hoogeveenschecourant.nl
robots.txt
Robots Exclusion Standard data for hoogeveenschecourant.nl
Resource Scan
Scan Details
Site Domain | hoogeveenschecourant.nl |
Base Domain | hoogeveenschecourant.nl |
Scan Status | Ok |
Last Scan | 2024-06-03T23:04:35+00:00 |
Next Scan | 2024-06-10T23:04:35+00:00 |
Last Scan
Scanned | 2024-06-03T23:04:35+00:00 |
URL | https://hoogeveenschecourant.nl/robots.txt |
Domain IPs | 104.18.41.52, 172.64.146.204, 2606:4700:4400::6812:2934, 2606:4700:4400::ac40:92cc |
Response IP | 172.64.146.204 |
Found | Yes |
Hash | 5d7fd51b7be1172cf15f5a1135b14e30b8ab94d896812cdfb96e84a5a9d1014f |
SimHash | 69349a62c1b3 |
Groups
*
Rule | Path |
---|---|
Disallow | /most-read |
Disallow | /tag |
Disallow | /search |
Allow | / |
Other Records
Field | Value |
---|---|
sitemap | https://static.hoogeveenschecourant.nl/sitemap/sitemap.xml.gz |
sitemap | https://static.hoogeveenschecourant.nl/sitemap/sitemap_news.xml.gz |
sitemap | https://static.hoogeveenschecourant.nl/sitemap/sitemap_sections.xml.gz |