papendrechtsnieuwsblad.nl
robots.txt
Robots Exclusion Standard data for papendrechtsnieuwsblad.nl
Resource Scan
Scan Details
Site Domain | papendrechtsnieuwsblad.nl |
Base Domain | papendrechtsnieuwsblad.nl |
Scan Status | Ok |
Last Scan | 2024-08-29T13:50:46+00:00 |
Next Scan | 2024-09-28T13:50:46+00:00 |
Last Scan
Scanned | 2024-08-29T13:50:46+00:00 |
URL | https://papendrechtsnieuwsblad.nl/robots.txt |
Redirect | https://www.ad.nl/robots.txt |
Redirect Domain | www.ad.nl |
Redirect Base | ad.nl |
Domain IPs | 2600:1413:b000:d::6011:6011, 2600:1413:b000:d::6011:6016, 96.17.96.17, 96.17.96.22 |
Redirect IPs | 2600:1413:b000:d::6011:6009, 2600:1413:b000:d::6011:600e, 96.17.96.14, 96.17.96.9 |
Response IP | 23.50.232.233 |
Found | Yes |
Hash | 5ec67004223ba2d2ed5f769ee217457a1c68e006a642909819a0a9a2d13c055d |
SimHash | 29391b58cd75 |
Groups
*
Rule | Path |
---|---|
Disallow | /*webview |
Disallow | /auth |
Disallow | /*widget* |
Disallow | /*?*otag= |
Disallow | /*?*abo_type= |
Disallow | /*?*utm_source= |
Disallow | /*?*currentArticleId= |
Disallow | /*?*articleUrl= |
Disallow | /zoeken?query= |
Disallow | /inloggen?* |
Disallow | /login?* |
Disallow | *~ab9e5892* |
Disallow | *~af7ac112* |
Disallow | *~a2575106* |
Disallow | *?*redirect_url=* |
Other Records
Field | Value |
---|---|
sitemap | https://www.ad.nl/sitemap.xml |
sitemap | https://www.ad.nl/sitemap-news.xml |
Comments