denhaag.nl
robots.txt

Robots Exclusion Standard data for denhaag.nl

Resource Scan

Scan Details

Site Domain denhaag.nl
Base Domain denhaag.nl
Scan Status Ok
Last Scan2026-02-19T03:59:30+00:00
Next Scan 2026-03-21T03:59:30+00:00

Last Scan

Scanned2026-02-19T03:59:30+00:00
URL https://denhaag.nl/robots.txt
Redirect https://www.denhaag.nl/nl/?robots=1
Redirect Domain www.denhaag.nl
Redirect Base denhaag.nl
Domain IPs 2.16.6.5
Redirect IPs 2a02:26f0:9c00::5c7b:674b, 2a02:26f0:9c00::5c7b:6760, 95.100.86.203, 95.100.86.218
Response IP 23.53.1.38
Found Yes
Hash 7c5ced73f3d711dc5b391072176ebc5f9ff1744f1678d7b6a5e28a77d4fc2e1e
SimHash c06dc9c44113

Groups

*

Rule Path
Disallow /nl/wp-admin/
Allow /nl/wp-admin/admin-ajax.php

*

Rule Path
Disallow /*.pdf$

*

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.denhaag.nl/nl/sitemap_index.xml
sitemap https://www.denhaag.nl/en/sitemap_index.xml
sitemap https://www.denhaag.nl/pers/sitemap_index.xml
sitemap https://www.denhaag.nl/cybersecurity/sitemap_index.xml

Comments

  • GDH-1800 - PDF crawling blokkade
  • START YOAST BLOCK
  • ---------------------------
  • ---------------------------
  • END YOAST BLOCK