chaletdejardin.fr
robots.txt

Robots Exclusion Standard data for chaletdejardin.fr

Resource Scan

Scan Details

Site Domain chaletdejardin.fr
Base Domain chaletdejardin.fr
Scan Status Ok
Last Scan2024-09-24T06:57:05+00:00
Next Scan 2024-10-24T06:57:05+00:00

Last Scan

Scanned2024-09-24T06:57:05+00:00
URL https://chaletdejardin.fr/robots.txt
Redirect https://www.chaletdejardin.fr/robots.txt
Redirect Domain www.chaletdejardin.fr
Redirect Base chaletdejardin.fr
Domain IPs 104.21.14.204, 172.67.160.139, 2606:4700:3032::6815:ecc, 2606:4700:3037::ac43:a08b
Redirect IPs 104.21.14.204, 172.67.160.139, 2606:4700:3032::6815:ecc, 2606:4700:3037::ac43:a08b
Response IP 104.21.14.204
Found Yes
Hash 1a82deb4b6b7035b7afea36a66f67266dc7805132476546ba3edba6965b30ea6
SimHash ec0467528701

Groups

*

Rule Path
Disallow /admin/
Disallow /*?*sorting
Disallow /*?*toggle_filter=
Disallow /*?*criteria
Disallow /*?*id=
Disallow /*?*page=
Disallow /index.php/
Disallow */pdf/*
Disallow *.pdf
Disallow /reviews/
Disallow /cart/
Disallow /checkout/
Disallow /message-sent/
Disallow /leads/*
Disallow */assembly-info/pdf
Disallow */assembly-info/export

dotbot

Rule Path
Disallow /

smtbot

Rule Path
Disallow /

zoominfobot

Rule Path
Disallow /

aspiegelbot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

yak

Rule Path
Disallow /

mojeekbot

Rule Path
Disallow /

coccocbot-web

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

mail.ru_bot

Rule Path
Disallow /

adbeat_bot

Rule Path
Disallow /

tweetmemebot

Rule Path
Disallow /

surdotlybot

Rule Path
Disallow /

startmebot

Rule Path
Disallow /

hubspot crawler

Rule Path
Disallow /

houzzbot

Rule Path
Disallow /

coccocbot-image

Rule Path
Disallow /

heurekabot

Rule Path
Disallow /

seobilitybot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.chaletdejardin.fr/sitemap.xml

Comments

  • Do not crawl admin page
  • Do not crawl these pages
  • Unwelcome crawlers