labasemarseille.org
robots.txt

Robots Exclusion Standard data for labasemarseille.org

Resource Scan

Scan Details

Site Domain labasemarseille.org
Base Domain labasemarseille.org
Scan Status Ok
Last Scan2024-06-29T16:46:55+00:00
Next Scan 2024-07-13T16:46:55+00:00

Last Scan

Scanned2024-06-29T16:46:55+00:00
URL https://labasemarseille.org/robots.txt
Domain IPs 89.234.177.160
Response IP 89.234.177.160
Found Yes
Hash 29d032d0186e63d78d073b3de0a355aa44660a9814c9e394aa9cd8446aa11700
SimHash 0036f73b588b

Groups

mj12bot

Rule Path
Disallow /

googlebot-image

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

baiduspider/2.0

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

ia_archiver-web.archive.org

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

telegrambot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

discordbot

Rule Path
Disallow /

applebot

Rule Path
Disallow /

ltx71

Rule Path
Disallow /

phantomjs

Rule Path
Disallow /

dalvik

Rule Path
Disallow /

cfnetwork

Rule Path
Disallow /

favicon

Rule Path
Disallow /

qwantify

Rule Path
Disallow /

python

Rule Path
Disallow /

exabot

Rule Path
Disallow /

dataprovider.com

Rule Path
Disallow /

applebot

Rule Path
Disallow /

python-urllib

Rule Path
Disallow /

java

Rule Path
Disallow /

libwww-perl

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

baidu

Rule Path
Disallow /

dataprovider.com

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

gowikibot

Rule Path
Disallow /

crawl

Rule Path
Disallow /

dalvik

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

googlebot-image

Rule Path
Disallow /

Other Records

Field Value
sitemap https://labasemarseille.org/sitemap.xml

Comments

  • User-agent: bingbot
  • Disallow: /