tz-online.de
robots.txt

Robots Exclusion Standard data for tz-online.de

Archived Snapshots

Resource Scan

Scan Details

Site Domain	tz-online.de
Base Domain	tz-online.de
Scan Status	Ok
Last Scan	2024-09-29T22:10:38+00:00
Next Scan	2024-10-06T22:10:38+00:00

Last Scan

Scanned	2024-09-29T22:10:38+00:00
URL	https://tz-online.de/robots.txt
Domain IPs	91.234.213.50
Response IP	91.234.213.50
Found	Yes
Hash	7859fb1170e85c3ced702197a5bad99fd57b8e2f208e3c36e0e0c5e1b19e4fa8
SimHash	63011b58a725

Groups

*

Rule	Path
Disallow	/lightweight-ajax
Disallow	/*?trafficsource
Disallow	/suche/
Disallow	/*?cmp=defrss
Disallow	/test/
Disallow	/fdn/bootstrap/
Disallow	/bi/bootstrap/
Disallow	/bi/doop/
Disallow	/sso/

Rule

Path

Disallow

/lightweight-ajax

Disallow

/*?trafficsource

Disallow

/suche/

Disallow

/*?cmp=defrss

Disallow

/test/

Disallow

/fdn/bootstrap/

Disallow

/bi/bootstrap/

Disallow

/bi/doop/

Disallow

/sso/

xovi

Rule	Path
Disallow	/

Rule

Path

Disallow

/

sistrix

Rule	Path
Disallow	/

Rule

Path

Disallow

/

searchmetricsbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

bingbot

Rule	Path
Disallow	/test/

Rule

Path

Disallow

/test/

gptbot

Rule	Path
Allow	/ueber-uns/
Disallow	/

Rule

Path

Allow

/ueber-uns/

Disallow

/

ccbot

Rule	Path
Allow	/ueber-uns/
Disallow	/

Rule

Path

Allow

/ueber-uns/

Disallow

/

msnbot

Rule	Path
Disallow	/test/

Rule

Path

Disallow

/test/

Other Records

Field	Value
crawl-delay	5

Field

Value

crawl-delay

5

amazonbot
anthropic-ai
applebot-extended
awariorssbot
awariosmartbot
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
dataforseobot
facebookbot
google-extended
imagesiftbot
magpie-crawler
omgili
omgilibot
peer39_crawler
peer39_crawler/1.0
perplexitybot
youbot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

Back to top

Other Records

Field	Value
sitemap	https://www.tz.de/news.xml

Field

Value

sitemap

https://www.tz.de/news.xml

Back to top

Comments

robots.txt www.tz.de
Legal notice: www.tz.de expressly reserves the right to use its content for commercial text and data mining (§ 44b UrhG).
The use of robots or other automated means to access www.tz.de or collect or mine data without the express permission of www.tz.de is strictly prohibited.

Back to top

tz-online.derobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

xovi

sistrix

searchmetricsbot

bingbot

gptbot

ccbot

msnbot

Other Records

amazonbotanthropic-aiapplebot-extendedawariorssbotawariosmartbotbytespiderccbotchatgpt-userclaudebotclaude-webcohere-aidataforseobotfacebookbotgoogle-extendedimagesiftbotmagpie-crawleromgiliomgilibotpeer39_crawlerpeer39_crawler/1.0perplexitybotyoubot

Other Records

Comments

tz-online.de
robots.txt

amazonbot
anthropic-ai
applebot-extended
awariorssbot
awariosmartbot
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
dataforseobot
facebookbot
google-extended
imagesiftbot
magpie-crawler
omgili
omgilibot
peer39_crawler
peer39_crawler/1.0
perplexitybot
youbot