ghirardelli.com
robots.txt

Robots Exclusion Standard data for ghirardelli.com

Resource Scan

Scan Details

Site Domain ghirardelli.com
Base Domain ghirardelli.com
Scan Status Ok
Last Scan2025-11-17T13:23:20+00:00
Next Scan 2025-11-24T13:23:20+00:00

Last Scan

Scanned2025-11-17T13:23:20+00:00
URL https://www.ghirardelli.com/robots.txt
Domain IPs 151.101.1.124, 151.101.129.124, 151.101.193.124, 151.101.65.124
Response IP 146.75.45.124
Found Yes
Hash 907ccf63de5803ca4c45a18f1d2139e18868fb7a48e261cc7dd74c09156646ff
SimHash 690fdf4bc2f1

Groups

*

Rule Path
Disallow /index.php/
Disallow /checkout/
Disallow /app/
Disallow /lib/
Disallow /*.php$
Disallow /pkginfo/
Disallow /report/
Disallow /var/
Disallow /customer/
Disallow /sendfriend/
Disallow /review/
Disallow /*SID%3D
Disallow *%26price*
Disallow *%26format*
Disallow *%26brand*
Disallow *%26type_of_chocolate*
Disallow *%26flavours*
Disallow *%26gift_ideas*
Disallow *%26dietary_requirements*
Disallow *%26all_diets*
Disallow *%26chocolate_type*
Disallow *%26difficulty*
Disallow *%26seasons*
Disallow *%26categories*
Disallow /account/create/*
Disallow /customer/account/*
Disallow /checkout/*
Disallow /catalogsearch/*

seekportbot

Rule Path
Disallow /

imagesiftbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

brightedge

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

oai-searchbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

Other Records

Field Value
sitemap https://www.ghirardelli.com/media/sitemap/sitemap_gcc.xml

Comments

  • Block Stacking Params (You will need to change these to match your filter options)
  • Block Catalog, Search and Accounts
  • Sitemap files
  • Block SeekportBot
  • Crawler Delay for ImagesiftBot
  • Crawler Delay for BrightEdge
  • Crawler Delay for OAI-SearchBot