lz.de
robots.txt

Robots Exclusion Standard data for lz.de

Resource Scan

Scan Details

Site Domain lz.de
Base Domain lz.de
Scan Status Ok
Last Scan2024-06-05T00:58:03+00:00
Next Scan 2024-06-12T00:58:03+00:00

Last Scan

Scanned2024-06-05T00:58:03+00:00
URL https://lz.de/robots.txt
Redirect https://www.lz.de/robots.txt
Redirect Domain www.lz.de
Redirect Base lz.de
Domain IPs 77.235.162.236
Redirect IPs 77.235.162.236
Response IP 77.235.162.236
Found Yes
Hash fc98b69720d2c1490981b6818f832e072fbbfb74218ad853b2540319c451837b
SimHash 70d450564921

Groups

ccbot
gptbot
chatgpt-user
google-extended

Rule Path
Disallow /

*

Rule Path
Disallow /tagsuche
Disallow /_em_cms/
Disallow /cms7/
Disallow /frage/
Disallow /microsites/
Disallow /suche
Disallow /profil/
Allow /_em_cms/globals/csslibs.php
Allow /_em_cms/globals/jslibs.php

Other Records

Field Value
sitemap https://www.lz.de/sitemap_lz_index.xml.gz
sitemap https://www.lz.de/sitemap_lz_index_news.xml.gz
sitemap https://www.lz.de/sitemap_lz_index_media.xml.gz