nhd.nl
robots.txt
Robots Exclusion Standard data for nhd.nl
Resource Scan
Scan Details
Site Domain | nhd.nl |
Base Domain | nhd.nl |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-09-21T10:29:02+00:00 |
Next Scan | 2024-12-20T10:29:02+00:00 |
Last Successful Scan
Scanned | 2023-05-24T23:12:06+00:00 |
URL | https://nhd.nl/robots.txt |
Redirect | https://www.noordhollandsdagblad.nl/robots.txt |
Redirect Domain | www.noordhollandsdagblad.nl |
Redirect Base | noordhollandsdagblad.nl |
Domain IPs | 104.21.55.91, 172.67.146.122, 2606:4700:3034::ac43:927a, 2606:4700:3035::6815:375b |
Redirect IPs | 104.16.219.40, 104.16.220.40, 2606:4700::6810:db28, 2606:4700::6810:dc28 |
Response IP | 104.16.219.40 |
Found | Yes |
Hash | 58f7f6334c4cd0a6865b753b88940ccb2642e926fcf75b6c8251a651eabce15e |
SimHash | 8df4464cfb80 |
Groups
*
Rule | Path |
---|---|
Disallow | /gva/ |
Disallow | /gva-mobile/ |
Disallow | /messagent/ |
Disallow | /extra/messagent/ |
Disallow | /utils/ |
Disallow | /account/ |
Disallow | /LoadTest/ |
Disallow | /api/ |
Disallow | /krant/ |
Disallow | /krant/archief |