thelocal.de
robots.txt

Robots Exclusion Standard data for thelocal.de

Resource Scan

Scan Details

Site Domain thelocal.de
Base Domain thelocal.de
Scan Status Ok
Last Scan2024-11-03T17:25:17+00:00
Next Scan 2024-11-10T17:25:17+00:00

Last Scan

Scanned2024-11-03T17:25:17+00:00
URL https://thelocal.de/robots.txt
Redirect https://www.thelocal.de/robots.txt
Redirect Domain www.thelocal.de
Redirect Base thelocal.de
Domain IPs 104.18.8.180, 104.18.9.180, 2606:4700::6812:8b4, 2606:4700::6812:9b4
Redirect IPs 104.18.8.180, 104.18.9.180, 2606:4700::6812:8b4, 2606:4700::6812:9b4
Response IP 104.18.9.180
Found Yes
Hash 97db0976384078c5ef42acf2467d9ded6e7c7bd46ef5237c9e734833b9aa0439
SimHash 5d1950d08f80

Groups

*

Rule Path
Disallow *.js*
Disallow *.css*
Disallow /200*
Disallow /2010*
Disallow /2011*
Disallow /2012*
Disallow /2013*
Disallow /amp*
Disallow /fonts*
Disallow /index.php/
Disallow *cms.*
Disallow *medium.*
Disallow *medium2.*
Disallow *discuss.*

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

arquivo-web-crawler

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

awariorssbot
awariosmartbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

coccocbot-web

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

jenkersbot

Rule Path
Disallow /

mojeek

Rule Path
Disallow /

moodlebot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

newsnow

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

peer39_crawler
peer39_crawler/1.0

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

seekr

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

youbot

Rule Path
Disallow /

yacy

Rule Path
Disallow /

yandex

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.thelocal.de/sitemap/de/news.xml
sitemap https://www.thelocal.de/sitemap/index.xml