houra.fr
robots.txt

Robots Exclusion Standard data for houra.fr

Resource Scan

Scan Details

Site Domain houra.fr
Base Domain houra.fr
Scan Status Ok
Last Scan2024-09-11T22:21:18+00:00
Next Scan 2024-10-11T22:21:18+00:00

Last Scan

Scanned2024-09-11T22:21:18+00:00
URL https://www.houra.fr/robots.txt
Domain IPs 195.154.219.240
Response IP 195.154.219.240
Found Yes
Hash 0ba2dd8ddc6ee2005b059573e4d8bd58f2a9b426f99de16b9a9a7f0022369e0e
SimHash d51fe2965b56

Groups

*

Rule Path
Disallow /classe/
Disallow /classes/
Disallow /com/
Disallow /conf/
Disallow /inc/
Disallow /jeux/
Disallow /mail/
Disallow /memolistes/
Disallow /newsletter/
Disallow /pan/
Disallow /pay/
Disallow /picking/
Disallow /survey/
Disallow /template/
Disallow /recherche/
Disallow /ws/
Disallow /ws2/
Disallow /ws_mobile/
Disallow /ws_sms/
Disallow /layer/layer_mail_mr.php?*
Disallow /lp
Disallow /cpt/

dotbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

sogou

Rule Path
Disallow /

yandex

Rule Path
Disallow /

shopwiki

Rule Path
Disallow /

python-urllib

Rule Path
Disallow /

python-urllib/2.5

Rule Path
Disallow /

netsparker

Rule Path
Disallow /

net veille

Rule Path
Disallow /

paarly

Rule Path
Disallow /

spycommerce

Rule Path
Disallow /

pricing assistant

Rule Path
Disallow /

price observatory

Rule Path
Disallow /

workit

Rule Path
Disallow /

keox

Rule Path
Disallow /

pricejet

Rule Path
Disallow /

prixan

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

geedobot

Rule Path
Disallow /

petalbot

Rule Path
Disallow /

awariorssbot

Rule Path
Disallow /

awariosmartbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

linguee

Rule Path
Disallow /

mtrobot

Rule Path
Disallow /

linkpadbot

Rule Path
Disallow /

twitterbot

Rule Path
Disallow /

olbico

Rule Path
Disallow /

serpstatbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

Comments

  • Blocage de certains bots inutiles

Warnings

  • `noindex` is not a known field.