novojornal.co.ao
robots.txt
Robots Exclusion Standard data for novojornal.co.ao
Resource Scan
Scan Details
Site Domain | novojornal.co.ao |
Base Domain | novojornal.co.ao |
Scan Status | Ok |
Last Scan | 2025-05-17T14:57:57+00:00 |
Next Scan | 2025-06-16T14:57:57+00:00 |
Last Scan
Scanned | 2025-05-17T14:57:57+00:00 |
URL | https://novojornal.co.ao/robots.txt |
Domain IPs | 104.21.20.67, 172.67.191.217, 2606:4700:3030::6815:1443, 2606:4700:3033::ac43:bfd9 |
Response IP | 104.21.20.67 |
Found | Yes |
Hash | 9dda77a48258d67d8138c262605be02a0c0029c0b4bf1546dd31a36c72b5c960 |
SimHash | e80b3c408793 |
Groups
*
Rule | Path |
---|---|
Disallow | /common/ |
Disallow | /errors/ |
Disallow | /admin/ |
Disallow | /assets/ |
Disallow | /search/ |
Other Records
Field | Value |
---|---|
sitemap | https://novojornal.co.ao/google_news.ashx |
Comments