bois-direct-usine.com
robots.txt

Robots Exclusion Standard data for bois-direct-usine.com

Resource Scan

Scan Details

Site Domain bois-direct-usine.com
Base Domain bois-direct-usine.com
Scan Status Ok
Last Scan2024-09-14T04:25:26+00:00
Next Scan 2024-10-14T04:25:26+00:00

Last Scan

Scanned2024-09-14T04:25:26+00:00
URL https://bois-direct-usine.com/robots.txt
Redirect https://www.bois-direct-usine.com/robots.txt
Redirect Domain www.bois-direct-usine.com
Redirect Base bois-direct-usine.com
Domain IPs 54.36.220.60
Redirect IPs 54.36.220.60
Response IP 54.36.220.60
Found Yes
Hash 1ac067e5fb803c49eb659d8edff806e22555dca43465fa931243b62ebd30ba43
SimHash 9896cf363693

Groups

*

Rule Path
Disallow /*account.html
Disallow /rechercher*
Disallow /mon-compte/*
Disallow /mon-panier/*
Disallow /*login*
Disallow /*addacomment*
Disallow /*subscribe.html
Disallow /*clickAd-*
Disallow /*events_search_init*
Disallow /*update_box_cart*
Disallow /*viewcart*
Disallow /*update_product_price*
Disallow /*clickAd-*
Disallow /*login*
Disallow /*logout*
Disallow /*password.html
Disallow /*imagezoom*
Disallow /content/*
Disallow /newsletter/*
Disallow /page-introuvable/
Disallow /*feedcomments*
Disallow /*emajineBenchmark*
Disallow /*addacomment*
Disallow /*sort_value*
Disallow /*sort_way*
Disallow /*fillform*
Disallow /*filtre%3D*

ahrefsbot
baiduspider
blexbot
barkrowler
catchbot
cfnetwork
cliqzbot
digincore
domaincrawler
domaincrawler/3.0
dotbot
duppies
exaleadcloudview
faviconizer
garlik
garlikcrawler
grapeshot
iccrawler - icjobs
linkdexbot
linkdexbot/2.0
linkdexbot/2.2
mail.ru
megaindex
melodibot
mj12bot
obot
panscient.com
proximic
python-urllib
r6_feedfetcher
r6_commentreader
root/0.1
semrushbot
semrushbot-sa
seznambot
sistrix
smtbot
spbot
spinn3r
titan
turnitinbot
turnitinbot
twengabot-2.0
ucrawler/1.0
ucrawler
uptimebot
wotbox
yandex
yisouspider
zoombot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.bois-direct-usine.com/sitemap-1-google.xml

Comments

  • Disallow: /*addtocart*
  • Disallow: /*getpagecontent*
  • robots interdits