archief.parool.nl
robots.txt
Robots Exclusion Standard data for archief.parool.nl
Resource Scan
Scan Details
Site Domain | archief.parool.nl |
Base Domain | parool.nl |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Request timed out. |
Last Scan | 2024-04-04T05:00:07+00:00 |
Next Scan | 2024-07-03T05:00:07+00:00 |
Last Successful Scan
Scanned | 2023-09-05T22:24:12+00:00 |
URL | http://archief.parool.nl/robots.txt |
Redirect | https://www.parool.nl/robots.txt |
Redirect Domain | www.parool.nl |
Redirect Base | parool.nl |
Domain IPs | 146.185.53.23 |
Redirect IPs | 2600:1413:b000:6::17d5:2bc4, 2600:1413:b000:6::17d5:2bd1, 96.17.96.13, 96.17.96.4 |
Response IP | 23.44.4.161 |
Found | Yes |
Hash | 5d21568a3eb1e77c736c48ae9219432293a6d79494308ee4d8c1f6478a3baa98 |
SimHash | 69500b595f75 |
Groups
*
Rule | Path |
---|---|
Allow | / |
Disallow | /*?otag* |
Disallow | /*?abo_type* |
Disallow | /*?URL_referrer* |
Disallow | /auth/ |
Disallow | /temptation/ |
Disallow | /*utm_campaign%3Dshared_earned* |
Disallow | /*redirectUri%3D* |
Disallow | /zoeken?query=* |
Disallow | /search?query=* |
Other Records
Field | Value |
---|---|
sitemap | https://www.parool.nl/sitemap.xml |
Comments