dolce-gusto.co.kr
robots.txt

Robots Exclusion Standard data for dolce-gusto.co.kr

Resource Scan

Scan Details

Site Domain dolce-gusto.co.kr
Base Domain dolce-gusto.co.kr
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-10-04T17:11:05+00:00
Next Scan 2025-01-02T17:11:05+00:00

Last Successful Scan

Scanned2023-03-08T13:01:11+00:00
URL https://dolce-gusto.co.kr/robots.txt
Redirect https://www.dolce-gusto.co.kr/robots.txt
Redirect Domain www.dolce-gusto.co.kr
Redirect Base dolce-gusto.co.kr
Domain IPs 3.224.123.132, 35.171.238.11
Redirect IPs 125.56.219.18, 23.32.29.90
Response IP 125.56.219.18
Found Yes
Hash 386ee67e7d245dd127a1b2cb97629a4a525e504eeee769b3ad7d6c675b1fe504
SimHash 8516f953eff3

Groups

*

Product Comment
* Allow crawling over the following paths
Rule Path
Allow /js/
Allow /media/js/
Allow /media/css_secure/
Allow /skin/
Allow /*.js
Allow /*.css
Allow /skin/frontend/
Disallow /404/
Disallow /app/
Disallow /cgi-bin/
Disallow /downloader/
Disallow /errors/
Disallow /includes/
Disallow /lib/
Disallow /media/
Disallow */instantcart/*
Disallow */f/*
Disallow */id/*
Disallow */page_id/*
Allow /media/catalog/product/
Allow /media/catalog/category/
Allow /media/wysiwyg/
Disallow /pkginfo/
Disallow /report/
Disallow /scripts/
Disallow /shell/
Disallow /stats/
Disallow /var/
Disallow /premio/index*
Disallow /premio/index/index*
Disallow /index.php/
Disallow /catalog/product_compare/
Disallow */catalog/category*
Disallow */catalog/product*
Disallow /catalogsearch/
Disallow *flavour_cup_size%3D*
Disallow *coffee_intensity%3D*
Disallow *capsules_number%3D*
Disallow *flavour_caffeine2%3D*
Disallow *hot_or_cold%3D*
Disallow *price%3D*
Disallow *machine_capacity_water_tank%3D*
Disallow *machine_search_color%3D*
Disallow *machine_type%3D*
Disallow /control/
Disallow /contacts/
Disallow */customer/*
Disallow /customize/
Disallow /newsletter/
Disallow /poll/
Disallow /review/
Disallow *show%3Dreview-form*
Disallow /sendfriend/
Disallow /tag/
Disallow /wishlist/
Disallow /catalog/product/gallery/
Disallow /test-refund-offer
Disallow */checkout/*
Disallow /onestepcheckout/
Disallow */customer/*
Disallow /cron.php
Disallow /cron.sh
Disallow /error_log
Disallow /install.php
Disallow /LICENSE.html
Disallow /LICENSE.txt
Disallow /LICENSE_AFL.txt
Disallow /STATUS.txt
Disallow /*.php$
Disallow /*?SID=
Disallow /*?dir*
Disallow /*?dir=desc
Disallow /*?dir=asc
Disallow /*?limit=all
Disallow /*?mode*
Disallow /*?gclid=

googlebot-image

Rule Path
Allow /media/catalog/product/
Allow /media/catalog/category/
Allow /media/wysiwyg/
Disallow /m/instantcart/

Other Records

Field Value
sitemap https://www.dolce-gusto.co.kr/sitemap.xml

Comments

  • Crawlers Setup
  • Directories
  • we allow product /catalog and wyiwyg media images to be indexed for seo benefits
  • Paths (clean URLs)
  • Do not crawl checkout and user account pages
  • Files
  • Paths (no clean URLs)
  • Do not crawl sub category pages that are sorted or filtered.
  • Do not crawl add to cart for mobile