noordhollandsdagblad.nl
robots.txt

Robots Exclusion Standard data for noordhollandsdagblad.nl

Resource Scan

Scan Details

Site Domain noordhollandsdagblad.nl
Base Domain noordhollandsdagblad.nl
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-03-25T20:15:53+00:00
Next Scan 2024-06-23T20:15:53+00:00

Last Successful Scan

Scanned2023-05-25T10:58:05+00:00
URL https://noordhollandsdagblad.nl/robots.txt
Redirect https://www.noordhollandsdagblad.nl/robots.txt
Redirect Domain www.noordhollandsdagblad.nl
Redirect Base noordhollandsdagblad.nl
Domain IPs 104.16.219.40, 104.16.220.40, 2606:4700::6810:db28, 2606:4700::6810:dc28
Redirect IPs 104.16.219.40, 104.16.220.40, 2606:4700::6810:db28, 2606:4700::6810:dc28
Response IP 104.16.220.40
Found Yes
Hash 58f7f6334c4cd0a6865b753b88940ccb2642e926fcf75b6c8251a651eabce15e
SimHash 8df4464cfb80

Groups

*

Rule Path
Disallow /gva/
Disallow /gva-mobile/
Disallow /messagent/
Disallow /extra/messagent/
Disallow /utils/
Disallow /account/
Disallow /LoadTest/
Disallow /api/
Disallow /krant/
Disallow /krant/archief