rhein-zeitung.de
robots.txt

Robots Exclusion Standard data for rhein-zeitung.de

Resource Scan

Scan Details

Site Domain rhein-zeitung.de
Base Domain rhein-zeitung.de
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-10-05T21:07:41+00:00
Next Scan 2025-01-03T21:07:41+00:00

Last Successful Scan

Scanned2021-10-17T06:17:00+00:00
URL https://rhein-zeitung.de/robots.txt
Redirect https://www.rhein-zeitung.de/robots.txt
Redirect Domain www.rhein-zeitung.de
Redirect Base rhein-zeitung.de
Found Yes
Hash 0ec6b77d46c1ee0d2855e68b894d9426bc5f4b801f8e1c1058af6209a2ada7cc
SimHash c844f00469d7

Groups

*

Rule Path
Disallow /cms_addon
Disallow /cms_docs
Disallow /cms_media/module_ob
Disallow /tools
Disallow /redFACT
Disallow /REST/frontend/itemstatistics
Disallow /REST/frontend/download
Disallow /pu_rz/ajax
Disallow /pu_rz/snippets
Disallow /pu_rz/assets

Other Records

Field Value
sitemap https://www.rhein-zeitung.de/sitemap-index/1-Google_Sitemap_All.xml
sitemap https://www.rhein-zeitung.de/sitemap-index/2-News_Sitema.xml
sitemap https://www.rhein-zeitung.de/sitemap-index/83-Google_Sitemap_Dossiers.xml

Comments

  • global live settings :