englishforum.ch
robots.txt

Robots Exclusion Standard data for englishforum.ch

Resource Scan

Scan Details

Site Domain englishforum.ch
Base Domain englishforum.ch
Scan Status Ok
Last Scan2024-11-12T16:53:46+00:00
Next Scan 2024-11-19T16:53:46+00:00

Last Scan

Scanned2024-11-12T16:53:46+00:00
URL http://englishforum.ch/robots.txt
Redirect https://www.thelocal.ch/robots.txt
Redirect Domain www.thelocal.ch
Redirect Base thelocal.ch
Domain IPs 34.89.169.66
Redirect IPs 104.18.6.208, 104.18.7.208, 2606:4700::6812:6d0, 2606:4700::6812:7d0
Response IP 104.18.7.208
Found Yes
Hash b872f924151a9babe3f333ea36cb3ef7c98136aaa6789867bc52c68ada15710f
SimHash 7d1150508e82

Groups

*

Rule Path
Disallow *cms.*
Disallow *medium.*
Disallow *medium2.*
Disallow *discuss.*

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

arquivo-web-crawler

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

awariorssbot
awariosmartbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

coccocbot-web

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

claude-web

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

facebookbot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

jenkersbot

Rule Path
Disallow /

mojeek

Rule Path
Disallow /

moodlebot

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

newsnow

Rule Path
Disallow /

omgili

Rule Path
Disallow /

omgilibot

Rule Path
Disallow /

peer39_crawler
peer39_crawler/1.0

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

seekr

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

youbot

Rule Path
Disallow /

yacy

Rule Path
Disallow /

yandex

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.thelocal.at/sitemap/at/news.xml
sitemap https://www.thelocal.at/sitemap/index.xml