thelocal.com
robots.txt

Robots Exclusion Standard data for thelocal.com

Resource Scan

Scan Details

Site Domain thelocal.com
Base Domain thelocal.com
Scan Status Ok
Last Scan2024-09-26T10:21:37+00:00
Next Scan 2024-10-03T10:21:37+00:00

Last Scan

Scanned2024-09-26T10:21:37+00:00
URL https://thelocal.com/robots.txt
Redirect https://www.thelocal.com/robots.txt
Redirect Domain www.thelocal.com
Redirect Base thelocal.com
Domain IPs 104.18.6.91, 104.18.7.91, 2606:4700::6812:65b, 2606:4700::6812:75b
Redirect IPs 104.18.6.91, 104.18.7.91, 2606:4700::6812:65b, 2606:4700::6812:75b
Response IP 104.18.7.91
Found Yes
Hash 05256e47f5f77cf99d04da82c48f3ef1a033cb569d78b12cb235f5831a469641
SimHash 5d094050cf80

Groups

*

Rule Path
Disallow *.js*
Disallow *.css*
Disallow /200*
Disallow /2010*
Disallow /2011*
Disallow /2012*
Disallow /2013*
Disallow /amp*
Disallow /fonts*
Disallow /index.php/
Disallow *cms.*
Disallow *medium.*
Disallow *medium2.*
Disallow *discuss.*

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

arquivo-web-crawler

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

awariorssbot
awariosmartbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

coccocbot-web

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

jenkersbot

Rule Path
Disallow /

mojeek

Rule Path
Disallow /

moodlebot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

newsnow

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

peer39_crawler
peer39_crawler/1.0

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

seekr

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

youbot

Rule Path
Disallow /

yacy

Rule Path
Disallow /

yandex

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.thelocal.com/sitemap/com/news.xml
sitemap https://www.thelocal.com/sitemap/index.xml