tagblatt.de
robots.txt

Robots Exclusion Standard data for tagblatt.de

Resource Scan

Site Domain	tagblatt.de
Base Domain	tagblatt.de
Scan Status	Failed
Failure Stage	Fetching resource.
Failure Reason	Server returned a client error.
Last Scan	2024-10-06T17:56:39+00:00
Next Scan	2024-10-13T17:56:39+00:00

Scanned	2024-09-28T17:56:24+00:00
URL	https://www.tagblatt.de/robots.txt
Domain IPs	217.182.184.195
Response IP	217.182.184.195
Found	Yes
Hash	a2e66160ba0e05e8f61e1074b45e4e53b31033e08310e39a5ca2ca5abb559c4d
SimHash	51495d50c734

Rule

Path

Disallow

/User

Disallow

/Dateien

Disallow

/Nachrichten/Suche

Disallow

/ScriptResource

Disallow

/WebResource

Disallow

/Verlag/Datenschutz

Disallow

/Marktplatz

Disallow

/Verlag/OAA-gesperrt

Field	Value
crawl-delay	2

Field

Value

crawl-delay

2

Back to top

Back to top