whatstheharm.net
robots.txt

Robots Exclusion Standard data for whatstheharm.net

Resource Scan

Scan Details

Site Domain whatstheharm.net
Base Domain whatstheharm.net
Scan Status Ok
Last Scan2025-08-18T04:47:31+00:00
Next Scan 2025-09-17T04:47:31+00:00

Last Scan

Scanned2025-08-18T04:47:31+00:00
URL http://whatstheharm.net/robots.txt
Domain IPs 66.175.58.9
Response IP 66.175.58.9
Found Yes
Hash cc2d075a79f8bbdf22e1284cfe014332c32f87db96137a03a72ad943e2bb2523
SimHash a924e804eb11

Groups

*

Rule Path
Disallow /cgi/
Disallow /Templates/
Disallow /logs/
Disallow /graphics/

googlebot-image

Rule Path
Disallow /*.gif$

Other Records

Field Value
sitemap http://whatstheharm.net/sitemap.xml