tlz.de
robots.txt
Robots Exclusion Standard data for tlz.de
Resource Scan
Scan Details
Site Domain | tlz.de |
Base Domain | tlz.de |
Scan Status | Ok |
Last Scan | 2024-11-13T16:07:22+00:00 |
Next Scan | 2024-11-20T16:07:22+00:00 |
Last Scan
Scanned | 2024-11-13T16:07:22+00:00 |
URL | https://tlz.de/robots.txt |
Redirect | https://www.tlz.de:443/robots.txt |
Redirect Domain | www.tlz.de |
Redirect Base | tlz.de |
Domain IPs | 18.185.81.127, 18.196.221.37, 3.72.121.83 |
Redirect IPs | 108.157.254.123, 108.157.254.41, 108.157.254.64, 108.157.254.96, 2600:9000:2753:2600:10:4c3:df40:93a1, 2600:9000:2753:a00:10:4c3:df40:93a1, 2600:9000:2753:ac00:10:4c3:df40:93a1, 2600:9000:2753:be00:10:4c3:df40:93a1, 2600:9000:2753:c600:10:4c3:df40:93a1, 2600:9000:2753:cc00:10:4c3:df40:93a1, 2600:9000:2753:dc00:10:4c3:df40:93a1, 2600:9000:2753:e00:10:4c3:df40:93a1 |
Response IP | 108.157.254.41 |
Found | Yes |
Hash | c4e0265729420c47f570e705f9a29c611dc94290b15b79b0748159a4d7d4cfaa |
SimHash | 5c0b9052c621 |
Groups
*
Rule | Path |
---|---|
Allow | /static/*/client.js |
Allow | /static/*/main.css |
Allow | /static/*/favicon.png |
Disallow | /stats/* |
Disallow | /*?config* |
Disallow | /*.xmli* |
Disallow | /*?service=Ajax |
Disallow | /*?service=ajax |
Disallow | /config/* |
Disallow | /test/* |
Disallow | /Test/* |
Disallow | /template/* |
Disallow | /*?*token=* |
Disallow | /*?*eventId=* |
Disallow | /static/* |
Disallow | /migration_import_no_section/* |
Disallow | /secure/ |
Disallow | /socialmedia/* |
Disallow | *reader_id%3DREADER_ID* |
Disallow | /suche/* |
Disallow | /*?widgetid= |
Disallow | /newsletter-result/ |
Disallow | *tpcc%3D* |
Disallow | /resources/ |
Disallow | /bin/ |
Disallow | /downloads/ |
Disallow | /service/newsletter-adconsent |
Disallow | /pagespeed_static/ |
Disallow | /resources/img/*icon*pagespeed |
semrushbot-sa
ahrefsbot
backlinkcrawler
linkchecker
dataforseobot
deepcrawl
majestic
majestic12
mj12bot
onpagebot
optimizer
rytebot
semrushbot
semrushbot-si
seobility
seodiver
seokicks
seokicks-robot
sistrix
openindexspider
openindexspider
sistrix optimizer
sistrix
sistrix crawler
siteauditbot
Rule | Path |
---|---|
Disallow | / |
amazonbot
anthropic-ai
applebot-extended
archive.org_bot
bytespider
ccbot
chatgpt-user
claudebot
claude-web
cohere-ai
diffbot
facebookbot
friendlycrawler
google-extended
googleother
gptbot
ia_archiver
img2dataset
omgili
omgilibot
peer39_crawler
peer39_crawler/1.0
perplexitybot
youbot
meta-externalagent
imagesiftbot
Rule | Path |
---|---|
Disallow | / |
arquivo-web-crawler
arquivo.pt
barkrowler
blexbot
browsertrix
brozzler
builtwith
cincraw
coccocbot
contao/crawler
dmbot
domainstatsbot
dotbot
dotbot
fluid
haosouspider
happywing
harsilbot
hatena antenna
heritrix
imagesiftbot
kazbtbot
kraken
linkdebot
linkfluence yak bot
mail.ru_bot
metajobbot
monsidobot
netestate
ogdwctcxcrawler
petalbot
researchbot
riddler
sentibot
rogerbot
semanticbot
semanticscholarbot
sirdatabot
spbot
special_archiver
splitsignalbot
tag-crawler
testcrawler
thinkers-bot
toplistbot
uipbot/1.0
urlsuma
user-agent
vsusearchspider
weborama-fetcher
wiseguys robot
wpbot
yeti
Rule | Path |
---|---|
Disallow | / |
Other Records
Field | Value |
---|---|
sitemap | https://www.tlz.de/sitemaps/news.xml |
Comments