rhein-zeitung.de
robots.txt
Robots Exclusion Standard data for rhein-zeitung.de
Resource Scan
Scan Details
Site Domain | rhein-zeitung.de |
Base Domain | rhein-zeitung.de |
Scan Status | Failed |
Failure Reason | Scan timed out. |
Last Scan | 2024-10-05T21:07:41+00:00 |
Next Scan | 2025-01-03T21:07:41+00:00 |
Last Successful Scan
Scanned | 2021-10-17T06:17:00+00:00 |
URL | https://rhein-zeitung.de/robots.txt |
Redirect | https://www.rhein-zeitung.de/robots.txt |
Redirect Domain | www.rhein-zeitung.de |
Redirect Base | rhein-zeitung.de |
Found | Yes |
Hash | 0ec6b77d46c1ee0d2855e68b894d9426bc5f4b801f8e1c1058af6209a2ada7cc |
SimHash | c844f00469d7 |
Groups
*
Rule | Path |
---|---|
Disallow | /cms_addon |
Disallow | /cms_docs |
Disallow | /cms_media/module_ob |
Disallow | /tools |
Disallow | /redFACT |
Disallow | /REST/frontend/itemstatistics |
Disallow | /REST/frontend/download |
Disallow | /pu_rz/ajax |
Disallow | /pu_rz/snippets |
Disallow | /pu_rz/assets |
Other Records
Field | Value |
---|---|
sitemap | https://www.rhein-zeitung.de/sitemap-index/1-Google_Sitemap_All.xml |
sitemap | https://www.rhein-zeitung.de/sitemap-index/2-News_Sitema.xml |
sitemap | https://www.rhein-zeitung.de/sitemap-index/83-Google_Sitemap_Dossiers.xml |
Comments