cappellini.com
robots.txt

Robots Exclusion Standard data for cappellini.com

Resource Scan

Scan Details

Site Domain cappellini.com
Base Domain cappellini.com
Scan Status Ok
Last Scan2025-11-08T05:42:12+00:00
Next Scan 2025-12-08T05:42:12+00:00

Last Scan

Scanned2025-11-08T05:42:12+00:00
URL https://cappellini.com/robots.txt
Redirect https://www.cappellini.com/robots.txt
Redirect Domain www.cappellini.com
Redirect Base cappellini.com
Domain IPs 104.18.18.205, 104.18.19.205, 2606:4700::6812:12cd, 2606:4700::6812:13cd
Redirect IPs 104.18.18.205, 104.18.19.205, 2606:4700::6812:12cd, 2606:4700::6812:13cd
Response IP 104.18.18.205
Found Yes
Hash f5c09fb6fef08794b63ea2877469bdbd3210ce36e51be61535406e7be8213d13
SimHash 699253ddc670

Groups

*
oai-searchbot

Rule Path
Allow /
Disallow /cdn-cgi/
Disallow */private-area/*
Disallow */added-to-cart.html
Disallow */shopping-cart.html
Disallow */checkout*
Disallow */_jcr_content/*
Disallow /content/experience-fragments/cappellini/
Disallow /content/experience-fragments/cappellini-catalog/
Disallow /content/dam/ld/cappellini/products/preloader/*
Disallow /content/dam/ld/cappellini/contacts/*
Disallow /content/dam/ld/cappellini/template-mail/*
Disallow /content/dam/ld/cappellini/products/*/20_area_professionals/*
Disallow /content/dam/ld/cappellini/private-area/*

Other Records

Field Value Comment
crawl-delay 10 10 seconds between page requests

facebookexternalhit/1.1 (+http://www.facebook.com/externalhit_uatext.php)

Rule Path
Disallow /csrf

Other Records

Field Value
crawl-delay 10

adidxbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

pinterestbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

adsbot-google

Rule Path
Allow /*?utm_source=*

Other Records

Field Value
crawl-delay 5

Other Records

Field Value
sitemap https://www.cappellini.com/it/it/sitemap.xml
sitemap https://www.cappellini.com/at/en/sitemap.xml
sitemap https://www.cappellini.com/be/en/sitemap.xml
sitemap https://www.cappellini.com/dk/en/sitemap.xml
sitemap https://www.cappellini.com/fi/en/sitemap.xml
sitemap https://www.cappellini.com/fr/en/sitemap.xml
sitemap https://www.cappellini.com/de/en/sitemap.xml
sitemap https://www.cappellini.com/lu/en/sitemap.xml
sitemap https://www.cappellini.com/nl/en/sitemap.xml
sitemap https://www.cappellini.com/no/en/sitemap.xml
sitemap https://www.cappellini.com/pt/en/sitemap.xml
sitemap https://www.cappellini.com/ww/en/sitemap.xml
sitemap https://www.cappellini.com/es/en/sitemap.xml
sitemap https://www.cappellini.com/se/en/sitemap.xml
sitemap https://www.cappellini.com/ch/en/sitemap.xml
sitemap https://www.cappellini.com/gb/en/sitemap.xml
sitemap https://www.cappellini.com/us/en/sitemap.xml

Comments

  • BOTs Directives
  • Block access to specific cloudflare endpoint
  • Block access to specific groups of pages
  • Block access to specific folders of assets
  • Facebook crawler too many requests
  • Allow Bingbot(s) to crawl faster
  • Sitemaps

Warnings

  • `visit-time` is not a known field.