zeit.de
robots.txt
Robots Exclusion Standard data for zeit.de
Resource Scan
Scan Details
Site Domain | zeit.de |
Base Domain | zeit.de |
Scan Status | Ok |
Last Scan | 2024-05-02T19:24:29+00:00 |
Next Scan | 2024-05-09T19:24:29+00:00 |
Last Scan
Scanned | 2024-05-02T19:24:29+00:00 |
URL | https://zeit.de/robots.txt |
Redirect | https://www.zeit.de/robots.txt |
Redirect Domain | www.zeit.de |
Redirect Base | zeit.de |
Domain IPs | 35.246.185.66 |
Redirect IPs | 151.101.130.49, 151.101.194.49, 151.101.2.49, 151.101.66.49 |
Response IP | 199.232.46.49 |
Found | Yes |
Hash | b206c99539d2a1f8886300fa5c131a7422143022cce8508100edce0af5c24c7d |
SimHash | 2224445c4d8f |
Groups
*
Rule | Path |
---|---|
Disallow | /zeit/ |
Disallow | /templates/ |
Disallow | /hp_channels/ |
Disallow | /send/ |
Disallow | /suche/ |
Disallow | /rezepte/suche/ |
Disallow | */comment-thread? |
Disallow | */liveblog-backend* |
Disallow | /framebuilder/ |
Disallow | /campus/framebuilder/ |
Disallow | /cre-1.0/tracking/*.js$ |
Disallow | /navigation-teasers* |
Other Records
Field | Value |
---|---|
sitemap | https://www.zeit.de/gsitemaps/index.xml |
Comments