cf-t.com
robots.txt

Robots Exclusion Standard data for cf-t.com

Resource Scan

Scan Details

Site Domain cf-t.com
Base Domain cf-t.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2026-02-09T06:16:40+00:00
Next Scan 2026-05-10T06:16:40+00:00

Last Successful Scan

Scanned2025-06-20T11:00:44+00:00
URL https://cf-t.com/robots.txt
Domain IPs 104.26.14.166, 104.26.15.166, 172.67.72.33, 2606:4700:20::681a:ea6, 2606:4700:20::681a:fa6, 2606:4700:20::ac43:4821
Response IP 172.67.72.33
Found Yes
Hash 85e4fad0357494377c65447dc18ac5390b314be8467da599d1675b6e55ef4e37
SimHash 2f105123f331

Groups

googlebot-image

Rule Path
Disallow

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

yandex

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

*

Rule Path
Disallow /index.php
Disallow /newsletter/
Disallow /catalog/
Disallow /sendfriend/
Disallow /control/
Disallow /contacts/
Disallow /customize/
Disallow /poll/
Disallow /review/
Disallow */shop*
Allow */shop-*
Disallow /rest/ca_en/V1/rewards/mine/guest-highlight/product/
Disallow /catalogsearch/
Disallow /catalog/product_compare/
Disallow /catalog/category/view/
Disallow /catalog/product/view/
Disallow /catalog/product/gallery/
Disallow /tag/
Disallow /checkout/
Disallow /customer/
Disallow /wishlist/
Disallow /*.php$
Disallow /*?*SID=
Disallow /*?*product_list_dir=
Disallow /*?*product_list_order=
Disallow /*%26*%3D*%26*%3D*%26*%3D*
Disallow /*?lesite_brand*&lesite_brand*
Disallow /*?*&lesite_brand*&lesite_brand*

Other Records

Field Value
sitemap https://cf-t.com/sitemap.xml

Comments

  • ICG robots.txt, v2024-10
  • Google Image Crawler Setup
  • All Crawlers Setup
  • Crawlers Setup
  • Paths (clean URLs)
  • Restrict Catalog Pages
  • Restrict Checkout and Customer account
  • Do not index session ID
  • Do not index pages that are sorted or filtered
  • Do not crawl multiple brand attributes
  • Sitemaps: