craphound.com
robots.txt

Robots Exclusion Standard data for craphound.com

Resource Scan

Scan Details

Site Domain craphound.com
Base Domain craphound.com
Scan Status Ok
Last Scan2025-09-21T15:22:33+00:00
Next Scan 2025-10-21T15:22:33+00:00

Last Scan

Scanned2025-09-21T15:22:33+00:00
URL https://craphound.com/robots.txt
Domain IPs 142.132.196.168, 172.104.232.45, 51.77.117.40
Response IP 51.77.117.40
Found Yes
Hash a9624d971cfb8025b996f94326862c6811af74655bd174be7f6d01f6b7d60f9f
SimHash 1a0a21976612

Groups

*

Rule Path
Allow /resources.html
Allow /
Allow /apt/
Allow /bomblamp/
Allow /boxingday99/
Allow /crap/
Allow /disneyland/
Allow /fic/listing.html
Allow /gerry/
Allow /gipsicon/
Allow /halloween99/
Allow /hse/
Allow /names/
Allow /nonfic/listing.html
Allow /names/
Allow /nycdec99/
Allow /origami/
Allow /philcon/
Allow /philcon99/
Allow /sanfrandisney/
Allow /thanksgiving/
Allow /wdw99/
Allow /xmastree99/
Allow /eyemodule.html

Comments

  • robots.txt file for http://www.craphound.com
  • comments to doctorow@craphound.com