thejewellers.com
robots.txt

Robots Exclusion Standard data for thejewellers.com

Resource Scan

Scan Details

Site Domain thejewellers.com
Base Domain thejewellers.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-08-20T02:36:26+00:00
Next Scan 2025-11-18T02:36:26+00:00

Last Successful Scan

Scanned2025-03-31T00:44:29+00:00
URL https://thejewellers.com/robots.txt
Domain IPs 104.18.26.251, 104.18.27.251, 2606:4700::6812:1afb, 2606:4700::6812:1bfb
Response IP 104.18.27.251
Found Yes
Hash 092e2cf02226b2b6e088a7e1df006f1c652c7d8a3745d4fbef7cd6d805b83276
SimHash ad187111fba0

Groups

urlspy

Rule Path
Disallow /

*
mozilla/5.0 (compatible;nostocrawlerbot/1.0;+http://my.nosto.com/tagging)
facebookcatalog/1.0

Rule Path
Allow /
Disallow /blackhole/
Disallow /sand/
Allow /catalog/seo_sitemap/category/
Allow /catalog/seo_sitemap/product/
Disallow /catalogsearch/result/
Disallow /wishlist/
Disallow /gwishlist/
Allow /media/catalog/product
Disallow /404/
Disallow /api/
Disallow /install/
Disallow /index.php/
Disallow /catalog/product/view/
Disallow /catalog/category/view/
Disallow /catalog/product_compare/
Disallow /catalogsearch/
Disallow /catalogsearch/advanced/
Disallow /catalogsearch/advanced/result/
Disallow /catalogsearch/term/
Disallow /catalogsearch/term/popular/
Disallow /checkout/
Disallow /checkout/cart/
Disallow /checkout/cart/add/
Disallow /control/
Disallow /contacts/index/
Disallow /contacts/index/post/
Disallow /customer/
Disallow /customize/
Disallow /newsletter/
Disallow /poll/
Disallow /review/
Disallow /sendfriend/
Disallow /tag/
Allow /*?p=
Disallow /*?p=*&
Disallow /*.php$
Disallow /*dir%3D*$
Disallow /*limit%3D*$
Disallow /*order%3D*$
Disallow /*price%3D*$
Disallow /*?SID=
Disallow /*?gclid=EAIaIQobChMI95KCw5_A_QIVhgiLCh1WZgCDEAAYASAAEgJgBvD_BwE

Comments

  • Begin UA Bans
  • End UA Bans
  • Crawlers Setup
  • Allow all to Nosto crawler
  • Allow Facebook
  • Blackholes to kill those who fail to honour robots.txt
  • and Sand Traps for email addresses scrapers
  • Allowable Index
  • Allowable images so you will show in image search and shopping networks
  • Directories
  • Paths (clean URLs)
  • Paths (no clean URLs)

Warnings

  • 2 invalid lines.