lamaison.fr
robots.txt

Robots Exclusion Standard data for lamaison.fr

Archived Snapshots

Resource Scan

Scan Details

Site Domain	lamaison.fr
Base Domain	lamaison.fr
Scan Status	Ok
Last Scan	2024-11-04T03:45:26+00:00
Next Scan	2024-12-04T03:45:26+00:00

Last Scan

Scanned	2024-11-04T03:45:26+00:00
URL	https://lamaison.fr/robots.txt
Redirect	https://www.lamaison.fr/robots.txt
Redirect Domain	www.lamaison.fr
Redirect Base	lamaison.fr
Domain IPs	151.101.1.124, 151.101.121.124, 151.101.129.124, 151.101.193.124, 151.101.65.124
Redirect IPs	151.101.1.124, 151.101.129.124, 151.101.193.124, 151.101.65.124
Response IP	199.232.45.124
Found	Yes
Hash	b689eebc86a5749734e9250930c3d01db6d7b3f56d7f5d4591332edd54aa3781
SimHash	c332f3684b9f

Groups

alphaseobot
alphaseobot-sa
baiduspider-favo
baiduspider-cpro
baiduspider-ads
baidu
baiduspider-news
baiduspider-video
baiduspider-image
baiduspider
blexbot
alexibot
alvinetspider
antenne hatena
apocalxexplorerbot
asterias
backdoorbot/1.0
bizinformation
black hole
blowfish/1.0
botalot
builtbottough
bullseye/1.0
bunnyslippers
cegbfeieh
cheesebot
cherrypicker
cherrypickerelite/1.0
cherrypickerse/1.0
copyrightcheck
cosmos
crescent
crescent internet toolpak http ole control v.1.0
disco pump 3.1
dittospyder
dotbot
emailcollector
emailsiphon
emailwolf
erocrawler
exabot
extractorpro
flamingo_searchengine
foobot
grapeshot
harvest/1.5
hloader
httplib
httrack
httrack 3.0
humanlinks
igentia
infonavirobot
jennybot
jikespider
kenjin spider
lexibot
libweb/clshttp
linkextractorpro
linkscan/8.1a unix
linkwalker
lwp-trivial
lwp-trivial/1.34
mata hari
microsoft url control - 5.01.4511
microsoft url control - 6.00.8169
miixpc
miixpc/4.2
mister pix
mlbot
moget
moget/2.1
ms search 4.0 robot
ms search 5.0 robot
naverbot
netants
netattache
netmechanic
nicerspro
offline explorer
openfind
openindexspider
propowerbot/2.14
prowebwalker
psbot
quepasacreep
queryn metasearch
repomonkey
rma
sightupbot
sitebot
sitesnagger
sitesucker
sogou web spider
sosospider
spankbot
spanner
speedy
suggybot
superbot
superbot/2.6
suzuran
szukacz/1.4
teleport
telesoft
the intraformant
thenomad
tighttwatbot
titan
tocrawl/urldispatcher
toscrawler
trendictionbot
true_robot
true_robot/1.0
turingos
turnitinbot
urlpouls
urly warning
vci
web image collector
webauto
webbandit
webbandit/3.50
webcopier
webcopy
webenhancer
webmasterworldforumbot
webmirror
webreaper
websauger
website extractor
website quester
webster pro
webstripper
webstripper/2.02
webzip
wget
wikiofeedbot
winhttrack
www-collector-e
xenu link sleuth/1.3.8
yacy
yandex
yrspider
zeus
zookabot

Rule	Path
Disallow	/

Rule

Path

Disallow

/

*

Rule	Path
Disallow	/app/
Disallow	/bin/
Disallow	/dev/
Disallow	/lib/
Disallow	/phpserver/
Disallow	/pkginfo/
Disallow	/report/
Disallow	/setup/
Disallow	/update/
Disallow	/var/
Disallow	/vendor/
Disallow	/index.php/
Disallow	/catalog/product_compare/
Disallow	/catalog/category/view/
Disallow	/catalog/product/view/
Disallow	/catalogsearch/
Disallow	/checkout/
Disallow	/control/
Disallow	/contacts/
Disallow	/customer/
Disallow	/customize/
Disallow	/newsletter/
Disallow	/review/
Disallow	/sendfriend/
Disallow	/wishlist/
Disallow	/mooauth/
Disallow	/composer.json
Disallow	/composer.lock
Disallow	/CONTRIBUTING.md
Disallow	/CONTRIBUTOR_LICENSE_AGREEMENT.html
Disallow	/COPYING.txt
Disallow	/Gruntfile.js
Disallow	/LICENSE.txt
Disallow	/LICENSE_AFL.txt
Disallow	/nginx.conf.sample
Disallow	/package.json
Disallow	/php.ini.sample
Disallow	/RELEASE_NOTES.txt
Disallow	/?product_list_mode=
Disallow	/?product_list_order=
Disallow	/?product_list_limit=
Disallow	/?product_list_dir=
Disallow	/*?category_es=
Disallow	/*?SID=

Rule

Path

Disallow

/app/

Disallow

/bin/

Disallow

/dev/

Disallow

/lib/

Disallow

/phpserver/

Disallow

/pkginfo/

Disallow

/report/

Disallow

/setup/

Disallow

/update/

Disallow

/var/

Disallow

/vendor/

Disallow

/index.php/

Disallow

/catalog/product_compare/

Disallow

/catalog/category/view/

Disallow

/catalog/product/view/

Disallow

/catalogsearch/

Disallow

/checkout/

Disallow

/control/

Disallow

/contacts/

Disallow

/customer/

Disallow

/customize/

Disallow

/newsletter/

Disallow

/review/

Disallow

/sendfriend/

Disallow

/wishlist/

Disallow

/mooauth/

Disallow

/composer.json

Disallow

/composer.lock

Disallow

/CONTRIBUTING.md

Disallow

/CONTRIBUTOR_LICENSE_AGREEMENT.html

Disallow

/COPYING.txt

Disallow

/Gruntfile.js

Disallow

/LICENSE.txt

Disallow

/LICENSE_AFL.txt

Disallow

/nginx.conf.sample

Disallow

/package.json

Disallow

/php.ini.sample

Disallow

/RELEASE_NOTES.txt

Disallow

/*?*product_list_mode=

Disallow

/*?*product_list_order=

Disallow

/*?*product_list_limit=

Disallow

/*?*product_list_dir=

Disallow

/*?category_es=

Disallow

/*?SID=

Back to top

Other Records

Field	Value
sitemap	http://www.lamaison.fr/media/sitemap_lamaison.xml

Field

Value

sitemap

http://www.lamaison.fr/media/sitemap_lamaison.xml

Back to top

Comments

Liste des robots exclus pour préserver les performances (commenter la ligne pour ne plus exclure un robot donné)
règles robots.txt pour tous les autres robots-crawlers (autorisés)
Paramètre Crawl-delay : nombre de secondes à attendre entre des requêtes successives.
Indiquer ce paramètre si vous rencontrez des problèmes de charge sur le serveur, notamment lors de passage de robot.
Crawl-delay: 30
Ne pas crawler ces répertoires :
Ne pas crawler les URLs avec ces segments de chemin :
Ne pas crawler ces fichiers :
Ne pas crawler les pages Product List incluant des paramètres de tri ou de filtre
Spécifique Elastic Search - Ne pas crawler les URL de navigation à facettes :
Ne pas crawler les URLs paramètre d’identification de session :
Chemin du ou des sitemap

Back to top

Warnings

1 invalid line.

Back to top

lamaison.frrobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Other Records

Comments

Warnings

lamaison.fr
robots.txt