decathlon.it
robots.txt

Robots Exclusion Standard data for decathlon.it

Resource Scan

Scan Details

Site Domain decathlon.it
Base Domain decathlon.it
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-09-19T08:45:19+00:00
Next Scan 2024-11-18T08:45:19+00:00

Last Successful Scan

Scanned2024-07-15T04:24:35+00:00
URL https://decathlon.it/robots.txt
Redirect https://www.decathlon.it/robots.txt
Redirect Domain www.decathlon.it
Redirect Base decathlon.it
Domain IPs 104.18.20.198, 104.18.21.198
Redirect IPs 104.18.40.100, 172.64.147.156
Response IP 104.18.40.100
Found Yes
Hash 894270febf393953afd98b14b1180e27b6d09fc3af657c5e62496d90d4f62386
SimHash 273b175b67a3

Groups

*

Rule Path
Disallow /*product-review-post*
Disallow /p-r/
Disallow /*utility/vote/
Disallow /*notes%3D*
Disallow /*?offer=*
Disallow /*sort%3D*
Disallow /*direction%3D*
Disallow /*?Ndrc=*
Disallow /*?offer=*
Disallow /*redirectUrl%3D*
Disallow /*/_/0$
Disallow /*CustomProductCatalog*
Disallow /*reviewsNext
Disallow /*CartLink_updateCartSize
Disallow /*pageReviews
Disallow /*reviewsProductPage
Disallow /p//
Disallow */ajax/
Disallow */lib/hit
Disallow */stringify-jsurl
Disallow /*dynaTraceMonitor
Disallow /r/*page%3D*
Disallow /search?Ntt=*
Disallow /search?*
Disallow /search
Disallow /store-review-post*
Disallow /deals-page/*
Disallow /p-a/*
Disallow *reviews_page%3D*
Disallow *reviews_note%3D*
Allow /deals-page/_/N-1et8fub$
Allow /deals-page/_/N-19c5tde$
Disallow /p/carta-regalo/_/*intcmp%3Dintcmp%3Abanner-giftcard-*
Disallow /login
Disallow /our-suggestions/
Disallow /_Incapsula_Resource*
Disallow /help/app/ask_store/*
Disallow *opeco*
Disallow /*?No=*
Disallow /*Nrpp%3D*
Disallow /*size%3D*
Disallow /*from%3D*
Disallow /*?Ns=*
Disallow /*%26Ns%3D*
Disallow /*/f-*_*
Disallow /cdn-cgi/

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 2

facebookbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

Other Records

Field Value
sitemap https://www.decathlon.it/sitemap/index.xml
sitemap https://www.decathlon.it/sitemap-index.xml
sitemap https://www.decathlon.it/sitemap/category-product-listing-0.xml
sitemap https://www.decathlon.it/sitemap/product-0.xml
sitemap https://www.decathlon.it/sitemap/product-review-0.xml
sitemap https://www.decathlon.it/sitemap/brand-0.xml
sitemap https://www.decathlon.it/sitemap/advice-0.xml
sitemap https://www.decathlon.it/sitemap/landing-0.xml
sitemap https://www.decathlon.it/sitemap/sport-0.xml
sitemap https://www.decathlon.it/sitemap/mp-product-0.xml
sitemap https://www.decathlon.it/sitemap/mp-product-1.xml
sitemap https://www.decathlon.it/sitemap/sitemap-static-store-view-02.xml

Comments

  • ROBOTS.TXT CONFIG
  • REVIEWS
  • FITERED PAGES or PAGINATE
  • Disallow: /*~*/*~*/*~*
  • DUST
  • PERF MONITORING TOOL
  • Sitemaps
  • REVIEWS PAGINATION
  • INTERNAL SEARCH
  • STORE
  • Assistance pages
  • UPDATE ALTERNATE
  • ALLOWED DEALS PAGE
  • DISALLOWED BOUTIQUE GIFT CARD PAGES
  • DISALLOWED LOGIN PATH
  • Contatta la squadra del negozio - 20220125
  • DISALLOW OPECO - 20220523
  • DISALLOWED CATEGORY FILTERS - 20220523
  • Sorting - dec. 23
  • Block filters until activation
  • 2023-08-11 cldflr config
  • 2023-10-17 Majestic limit
  • 2024-03-29 FacebookBot limit

Warnings

  • 2 invalid lines.