crosswordhouse.com
robots.txt

Robots Exclusion Standard data for crosswordhouse.com

Resource Scan

Scan Details

Site Domain crosswordhouse.com
Base Domain crosswordhouse.com
Scan Status Ok
Last Scan2026-01-22T14:47:59+00:00
Next Scan 2026-02-21T14:47:59+00:00

Last Scan

Scanned2026-01-22T14:47:59+00:00
URL https://crosswordhouse.com/robots.txt
Domain IPs 104.21.40.187, 172.67.156.76, 2606:4700:3031::ac43:9c4c, 2606:4700:3034::6815:28bb
Response IP 104.21.40.187
Found Yes
Hash ef637dd71e1d7fa1091eb37041282ce2ac5870983f43d84140b52e950842c97f
SimHash 6918dc70c6d3

Groups

*

Rule Path
Disallow /cgi-bin/
Disallow /tmp/
Allow /

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://crosswordhouse.com/sitemap.xml