cleasite.fr
robots.txt

Robots Exclusion Standard data for cleasite.fr

Resource Scan

Scan Details

Site Domain cleasite.fr
Base Domain cleasite.fr
Scan Status Ok
Last Scan2024-08-29T12:21:09+00:00
Next Scan 2024-09-28T12:21:09+00:00

Last Scan

Scanned2024-08-29T12:21:09+00:00
URL https://www.cleasite.fr/robots.txt
Domain IPs 20.199.122.254
Response IP 20.199.122.254
Found Yes
Hash 0b39fc14e7376569ca4cdbe50328b2569f23400de22def1b629ca00359de93a2
SimHash ca1cc5c24693

Groups

*

Rule Path
Allow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

haosouspider

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sogou inst spider

Rule Path
Disallow /

sogou spider2

Rule Path
Disallow /

nutch

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

seokicks

Rule Path
Disallow /

obot

Rule Path
Disallow /

mojeekbot

Rule Path
Disallow /

seekport crawler

Rule Path
Disallow /

yacybot

Rule Path
Disallow /

yeti

Rule Path
Disallow /

ucbrowser

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

tinytestbot

Rule Path
Disallow /

the knowledge ai

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

megaindex.ru/2.0

Rule Path
Disallow /

mqqbrowser

Rule Path
Disallow /

nimbostratus-bot/v1.3.2

Rule Path
Disallow /

liebaofast

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

sputnikbot/2.3

Rule Path
Disallow /

Comments

  • Allow Robots
  • Disallow Robots