cyclingtips.marketplacer.com
robots.txt

Robots Exclusion Standard data for cyclingtips.marketplacer.com

Resource Scan

Scan Details

Site Domain cyclingtips.marketplacer.com
Base Domain marketplacer.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2025-03-29T17:20:41+00:00
Next Scan 2025-05-28T17:20:41+00:00

Last Successful Scan

Scanned2025-01-22T17:19:18+00:00
URL https://cyclingtips.marketplacer.com/robots.txt
Redirect https://www.cycling-emporium.com/robots.txt
Redirect Domain www.cycling-emporium.com
Redirect Base cycling-emporium.com
Domain IPs 104.17.72.119, 104.17.73.119, 2606:4700::6811:4877, 2606:4700::6811:4977
Redirect IPs 104.18.4.212, 104.18.5.212, 2606:4700::6810:2568, 2606:4700::6810:3568
Response IP 104.18.5.212
Found Yes
Hash bb11ab9cd55e90907bfff3bafd50692d730e5e16749f170733de7d5cb127ed6e
SimHash 8a419a57e073

Groups

*

Rule Path
Disallow /client/
Disallow /banners/
Disallow /administration/
Disallow /adverts/search_box/
Disallow /adverts/phone_num/
Disallow /bikes/contact/
Disallow /advert/contact/
Disallow /baby_kids_toddler/contact/
Disallow /admin
Disallow /api
Disallow /carts
Disallow /carts/add_variant
Disallow /subscribe/thank_you
Disallow /orders/thank_you

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

rogerbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

facebookbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

yandex

Rule Path
Disallow /

dealgates bot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

meanpathbot

Rule Path
Disallow /

wise-guys

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.cycling-emporium.com/sitemap.xml.gz

Comments

  • See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file