diarioguardiao.pt
robots.txt
Robots Exclusion Standard data for diarioguardiao.pt
Resource Scan
Scan Details
Site Domain | diarioguardiao.pt |
Base Domain | diarioguardiao.pt |
Scan Status | Ok |
Last Scan | 2025-03-29T03:30:14+00:00 |
Next Scan | 2025-04-05T03:30:14+00:00 |
Last Scan
Scanned | 2025-03-29T03:30:14+00:00 |
URL | https://diarioguardiao.pt/robots.txt |
Domain IPs | 104.21.73.46, 172.67.140.149, 2606:4700:3030::6815:492e, 2606:4700:3031::ac43:8c95 |
Response IP | 104.21.73.46 |
Found | Yes |
Hash | ea3b5fe1e9aca961b3504503c4bfd931f827c18c44fbd0494cdc6594647e305a |
SimHash | 4101d8000db3 |
Other Records
Field | Value |
---|---|
sitemap | https://diarioguardiao.pt/sitemap_index.xml |