media.conforama.fr
robots.txt

Robots Exclusion Standard data for media.conforama.fr

Resource Scan

Scan Details

Site Domain media.conforama.fr
Base Domain conforama.fr
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-09-16T23:52:21+00:00
Next Scan 2024-12-15T23:52:21+00:00

Last Successful Scan

Scanned2023-01-31T15:32:44+00:00
URL https://media.conforama.fr/robots.txt
Redirect https://www.conforama.fr/robots.txt
Redirect Domain www.conforama.fr
Redirect Base conforama.fr
Domain IPs 18.155.153.114, 18.155.153.32, 18.155.153.57, 18.155.153.79, 2600:9000:214f:1200:14:6907:b780:93a1, 2600:9000:214f:200:14:6907:b780:93a1, 2600:9000:214f:3000:14:6907:b780:93a1, 2600:9000:214f:4600:14:6907:b780:93a1, 2600:9000:214f:6c00:14:6907:b780:93a1, 2600:9000:214f:9000:14:6907:b780:93a1, 2600:9000:214f:a400:14:6907:b780:93a1, 2600:9000:214f:aa00:14:6907:b780:93a1
Redirect IPs 104.18.40.223, 172.64.147.33, 2606:4700:4400::6812:28df, 2606:4700:4400::ac40:9321
Response IP 172.64.147.33
Found Yes
Hash c294c38d85218e6aae74529a2221c70218bcce4214fbec14bf2f50f7ae2172e9
SimHash 0d32df73c275

Groups

*

Rule Path
Disallow /

googlebot
googlebot-image
mediapartners-google
googlebot-news
googlebot-video
adsbot-google
adsbot-google-mobile
adsbot-google-mobile-apps
bingbot
adidxbot
duckduckbot
slurp
oncrawl
qwantify
pinterest
ahrefsbot
ahrefssiteaudit
barkrowler

Rule Path
Allow /
Disallow /jsessionid
Disallow /errorPage
Disallow /error/maintenance.html*
Disallow /*/dynaTraceMonitor
Disallow /*/js/decodeUrls.js*
Disallow /m/
Disallow /medias/
Disallow /*.pdf*
Disallow /*recherche-conforama*
Disallow /recherche-conforama/*
Disallow /*fromSearch*
Disallow /checkout/
Disallow /checkout
Disallow /compte/
Disallow /compte
Disallow /login/
Disallow /login
Disallow /identification*
Disallow /wishlist*
Disallow /authuser*
Disallow /special/
Disallow /*I-Page*
Disallow /*productavailable-info*
Disallow /*NW-11575-idstockmagasin*
Disallow /*showBVoiceAvis%3D*
Disallow /*fromBoutique%3D*
Disallow /*fromRetImmediat*
Disallow /*view%3Dlist*
Disallow /*view%3Dgrid*
Disallow /*show%3DStore*
Disallow /*view%3Ddefault*
Disallow /*query%3D*
Disallow /*typeListe*
Disallow /*indexProduct*
Disallow /*limit%3D*
Disallow /miniFiche
Disallow /store
Disallow /fr/
Disallow /404$
Disallow /404.html*
Disallow /*gap-promo*
Disallow /*carte-presto*
Disallow /*productCode%3D*
Disallow /*Incapsula_Resource*
Disallow /p/*/json*
Disallow /xn--/*
Disallow /webapp/wcs/stores/*
Disallow /webapp$
Disallow /virtPath/*
Disallow /univers-du-lit/*.html
Disallow /*item_group_id*
Disallow /*ved%3D*
Disallow /*sa%3D*
Disallow /*cstrackid%3D*
Disallow /contextual/
Disallow /confoUsers/
Disallow /cart/
Disallow /occ/
Disallow /cdn-cgi/
Disallow /*frz-smartcache*
Disallow /*frz-timeout*
Disallow /*frz-reason*
Disallow /*frz-flush*
Disallow /fstrz/r/stats-euwest1.fzcdn.net/beacon*
Disallow /*popin*
Disallow /*Popin*
Disallow /popin/
Disallow /*storePopin*
Disallow /*alertDispoPopin*
Disallow /*storeExpoPopin*
Disallow /*stockInOtherStoresPopin*
Disallow /*miraklOfferGradePopin*
Disallow /*storePopin*
Disallow /*fullCBPopin*

adsbot-google
adsbot-google-mobile
adsbot-google-mobile-apps
mediapartners-google
adidxbot

Rule Path
Allow /special/
Allow /*xtor*
Allow /*gclid*
Allow /*gclsrc*
Allow /*wiz*
Allow /*awc*
Allow /*utm*
Allow /*ved%3D*
Allow /*sa%3D*
Allow /*cstrackid%3D*

googlebot
googlebot-image
mediapartners-google
googlebot-news
googlebot-video
adsbot-google
adsbot-google-mobile
adsbot-google-mobile-apps
bingbot
adidxbot
duckduckbot
slurp
oncrawl
qwantify
pinterest
ahrefsbot
ahrefssiteaudit
barkrowler

Rule Path
Allow /*.jpg************
Allow /*.jpeg************
Allow /*.png************
Allow /*.gif************
Allow /*.webp************
Allow /*.svg************
Allow /*.tif************
Allow /*.mp4***********
Allow /m/webmaster/catalogues/

Other Records

Field Value
sitemap https://www.conforama.fr/sitemap-index.xml

Comments

  • ____ __
  • / ___| ___ _ __ / _| ___ _ __ __ _ _ __ ___ __ _
  • | | / _ \ | '_ \ | |_ / _ \ | '__| / _` | | '_ ` _ \ / _` |
  • | |___ | (_) | | | | | | _| | (_) | | | | (_| | | | | | | | | (_| |
  • \____| \___/ |_| |_| |_| \___/ |_| \__,_| |_| |_| |_| \__,_|
  • Le confort pour crawl - v26.20221003
  • Block access to all UA by default
  • Allow access to specific UA
  • Block specific technical
  • Block medias
  • Block search
  • Block account pages
  • Block spider trap
  • Disallow: /*LGWCODE*
  • Block fasterize parameters
  • Block popins
  • Allow ads bot on specials
  • Allow images and special medias
  • Allow search crawlers to discover the sitemap