geniemusic.com
robots.txt

Robots Exclusion Standard data for geniemusic.com

Archived Snapshots

Resource Scan

Scan Details

Site Domain	geniemusic.com
Base Domain	geniemusic.com
Scan Status	Ok
Last Scan	2026-02-22T03:16:48+00:00
Next Scan	2026-03-24T03:16:48+00:00

Last Scan

Scanned	2026-02-22T03:16:48+00:00
URL	https://geniemusic.com/robots.txt
Domain IPs	206.168.149.26
Response IP	206.168.149.26
Found	Yes
Hash	5dae807d9b90d46b0bcc1e83767b4ec05ae2ed03c3a0ffe3b5f34c324745d88e
SimHash	ce145111ce65

Groups

*

Rule	Path
Disallow	/bin/
Disallow	/cgi-bin/
Disallow	/data/
Disallow	/etc/
Disallow	/icons/
Disallow	/lib/
Disallow	/opt/
Disallow	/usage/
Disallow	/usr/

Rule

Path

Disallow

/bin/

Disallow

/cgi-bin/

Disallow

/data/

Disallow

/etc/

Disallow

/icons/

Disallow

/lib/

Disallow

/opt/

Disallow

/usage/

Disallow

/usr/

turnitinbot

Rule	Path	Comment
Disallow	/	Disallows all urls to turnitin.com, a site which tries to catch students plaigarizing web content in their school papers

Rule

Path

Comment

Disallow

/

Disallows all urls to turnitin.com, a site which tries to catch students plaigarizing web content in their school papers

Back to top

Comments

/robots.txt as defined in
<http://info.webcrawler.com/mak/projects/robots/exclusion.html>

Back to top

geniemusic.comrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

turnitinbot

Comments

geniemusic.com
robots.txt