thelocal.it
robots.txt

Robots Exclusion Standard data for thelocal.it

Resource Scan

Scan Details

Site Domain thelocal.it
Base Domain thelocal.it
Scan Status Ok
Last Scan2024-09-26T16:56:10+00:00
Next Scan 2024-10-03T16:56:10+00:00

Last Scan

Scanned2024-09-26T16:56:10+00:00
URL https://thelocal.it/robots.txt
Redirect https://www.thelocal.it/robots.txt
Redirect Domain www.thelocal.it
Redirect Base thelocal.it
Domain IPs 104.18.4.188, 104.18.5.188, 2606:4700::6812:4bc, 2606:4700::6812:5bc
Redirect IPs 104.18.4.188, 104.18.5.188, 2606:4700::6812:4bc, 2606:4700::6812:5bc
Response IP 104.18.5.188
Found Yes
Hash f1aac0f33036f000f8c5bda55b14be5eb065d9521f023dc04e303c41132e051a
SimHash 5d2950d18f80

Groups

*

Rule Path
Disallow *.js*
Disallow *.css*
Disallow /200*
Disallow /2010*
Disallow /2011*
Disallow /2012*
Disallow /2013*
Disallow /amp*
Disallow /fonts*
Disallow /index.php/
Disallow *cms.*
Disallow *medium.*
Disallow *medium2.*
Disallow *discuss.*

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

arquivo-web-crawler

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

awariorssbot
awariosmartbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

coccocbot-web

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

jenkersbot

Rule Path
Disallow /

mojeek

Rule Path
Disallow /

moodlebot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

newsnow

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

peer39_crawler
peer39_crawler/1.0

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

seekr

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

youbot

Rule Path
Disallow /

yacy

Rule Path
Disallow /

yandex

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.thelocal.it/sitemap/it/news.xml
sitemap https://www.thelocal.it/sitemap/index.xml