geniemusic.com
robots.txt

Robots Exclusion Standard data for geniemusic.com

Resource Scan

Scan Details

Site Domain geniemusic.com
Base Domain geniemusic.com
Scan Status Ok
Last Scan2026-02-22T03:16:48+00:00
Next Scan 2026-03-24T03:16:48+00:00

Last Scan

Scanned2026-02-22T03:16:48+00:00
URL https://geniemusic.com/robots.txt
Domain IPs 206.168.149.26
Response IP 206.168.149.26
Found Yes
Hash 5dae807d9b90d46b0bcc1e83767b4ec05ae2ed03c3a0ffe3b5f34c324745d88e
SimHash ce145111ce65

Groups

*

Rule Path
Disallow /bin/
Disallow /cgi-bin/
Disallow /data/
Disallow /etc/
Disallow /icons/
Disallow /lib/
Disallow /opt/
Disallow /usage/
Disallow /usr/

turnitinbot

Rule Path Comment
Disallow / Disallows all urls to turnitin.com, a site which tries to catch students plaigarizing web content in their school papers

Comments

  • /robots.txt as defined in
  • <http://info.webcrawler.com/mak/projects/robots/exclusion.html>