theicebook.com
robots.txt
Robots Exclusion Standard data for theicebook.com
Resource Scan
Scan Details
| Site Domain | theicebook.com |
| Base Domain | theicebook.com |
| Scan Status | Ok |
| Last Scan | 2025-12-02T03:12:20+00:00 |
| Next Scan | 2026-01-01T03:12:20+00:00 |
Last Scan
| Scanned | 2025-12-02T03:12:20+00:00 |
| URL | https://theicebook.com/robots.txt |
| Domain IPs | 104.21.46.31, 172.67.222.238, 2606:4700:3035::ac43:deee, 2606:4700:3037::6815:2e1f |
| Response IP | 172.67.222.238 |
| Found | Yes |
| Hash | b36617c8fc60eed3f7fcb7362179bb38853b1d74f812604469bd50a9d0fde9cd |
| SimHash | 6150c85989d2 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /mapa/ |
| Disallow | /n%C3%BAmero/ |
| Disallow | /biblioteca/ |
| Disallow | /%D1%81at%C3%A1logo/ |
| Disallow | /reveja/ |
Other Records
| Field | Value |
|---|---|
| sitemap | https://theicebook.com/sitemap.xml |