ribblecycles.co.uk
robots.txt

Robots Exclusion Standard data for ribblecycles.co.uk

Resource Scan

Scan Details

Site Domain ribblecycles.co.uk
Base Domain ribblecycles.co.uk
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-09-17T08:18:22+00:00
Next Scan 2025-12-16T08:18:22+00:00

Last Successful Scan

Scanned2024-10-30T07:26:41+00:00
URL https://ribblecycles.co.uk/robots.txt
Redirect https://checkout.ribblecycles.co.uk/robots.txt
Redirect Domain checkout.ribblecycles.co.uk
Redirect Base ribblecycles.co.uk
Domain IPs 104.26.0.124, 104.26.1.124, 172.67.71.100, 2606:4700:20::681a:17c, 2606:4700:20::681a:7c, 2606:4700:20::ac43:4764
Redirect IPs 34.142.122.118
Response IP 34.142.122.118
Found Yes
Hash 494979d915956d04a631e976010c081f532dc0f6b6caf065e336ae61c4f11b9f
SimHash 422eb9c7e637

Groups

*

Rule Path
Disallow /index.php/
Disallow /*?
Disallow /checkout/
Disallow /app/
Disallow /lib/
Disallow /*.php$
Disallow /pkginfo/
Disallow /report/
Disallow /var/
Disallow /catalog/
Disallow /customer/
Disallow /sendfriend/
Disallow /review/
Disallow /*SID%3D
Disallow *?___store
Disallow *?gisAutoCall
Disallow /amp/
Disallow /blog/tag/
Disallow /wishlist
Disallow /catalog
Disallow /catalogsearch/result/?q=
Disallow /destination
Disallow /directory
Disallow /merchandising
Disallow /checkout
Disallow /admin
Disallow /productalert
Disallow */build/
Disallow /bikes/bikebuilder/*
Disallow /dist/*

googlebot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

mj12bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

baiduspider

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

rogerbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://checkout.ribblecycles.co.uk/pub/media/sitemaps/www.ribblecycles.co.uk/sitemap.xml

Comments

  • BLOCK GENERAL AREAS - don't need to be crawled
  • UNWANTED BOTS - disallow
  • Throttle crawling for Baiduspider
  • Throttle crawling for Rogerbot
  • SITEMAP REFERENCES