mediahuis.com
robots.txt

Robots Exclusion Standard data for mediahuis.com

Resource Scan

Scan Details

Site Domain mediahuis.com
Base Domain mediahuis.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-10-27T14:52:34+00:00
Next Scan 2025-01-25T14:52:34+00:00

Last Successful Scan

Scanned2023-03-15T10:47:46+00:00
URL https://mediahuis.com/robots.txt
Domain IPs 104.18.28.76, 104.18.29.76, 2606:4700::6812:1c4c, 2606:4700::6812:1d4c
Response IP 104.18.28.76
Found Yes
Hash 6e5fde8ee6b3127bc8496ba348c094b8218ba110491470d7ed2a97b61a23a78e
SimHash 6b109c406db0

Groups

*

Rule Path
Disallow /wp-admin/
Allow /wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://www.mediahuis.com/sitemap_index.xml

Comments

  • XML Sitemaps

Warnings

  • 1 invalid line.