crosswordhouse.com
robots.txt
Robots Exclusion Standard data for crosswordhouse.com
Resource Scan
Scan Details
| Site Domain | crosswordhouse.com |
| Base Domain | crosswordhouse.com |
| Scan Status | Ok |
| Last Scan | 2026-01-22T14:47:59+00:00 |
| Next Scan | 2026-02-21T14:47:59+00:00 |
Last Scan
| Scanned | 2026-01-22T14:47:59+00:00 |
| URL | https://crosswordhouse.com/robots.txt |
| Domain IPs | 104.21.40.187, 172.67.156.76, 2606:4700:3031::ac43:9c4c, 2606:4700:3034::6815:28bb |
| Response IP | 104.21.40.187 |
| Found | Yes |
| Hash | ef637dd71e1d7fa1091eb37041282ce2ac5870983f43d84140b52e950842c97f |
| SimHash | 6918dc70c6d3 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /cgi-bin/ |
| Disallow | /tmp/ |
| Allow | / |
Other Records
| Field | Value |
|---|---|
| crawl-delay | 10 |
Other Records
| Field | Value |
|---|---|
| sitemap | https://crosswordhouse.com/sitemap.xml |