cycleworld.com.au
robots.txt

Robots Exclusion Standard data for cycleworld.com.au

Resource Scan

Scan Details

Site Domain cycleworld.com.au
Base Domain cycleworld.com.au
Scan Status Ok
Last Scan2024-11-09T15:23:50+00:00
Next Scan 2024-11-16T15:23:50+00:00

Last Scan

Scanned2024-11-09T15:23:50+00:00
URL https://cycleworld.com.au/robots.txt
Domain IPs 104.18.5.212
Response IP 104.18.5.212
Found Yes
Hash 3b05ef2797c89d5d4d7b6616e49fd4dbe5fdbdcb0ecfa75cb90e241823271a46
SimHash 8a410a57e773

Groups

*

Rule Path
Disallow /client/
Disallow /banners/
Disallow /administration/
Disallow /adverts/search_box/
Disallow /adverts/phone_num/
Disallow /bikes/contact/
Disallow /advert/contact/
Disallow /baby_kids_toddler/contact/
Disallow /admin
Disallow /api
Disallow /carts
Disallow /carts/add_variant
Disallow /subscribe/thank_you
Disallow /orders/thank_you
Disallow /competition/thanks

googlebot

Rule Path
Allow /api/mobile
Disallow /api/mobile/orders
Disallow /api/mobile/user

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

rogerbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

facebookbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

yandex

Rule Path
Disallow /

dealgates bot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

meanpathbot

Rule Path
Disallow /

wise-guys

Rule Path
Disallow /

Other Records

Field Value
sitemap https://cycleworld.com.au/sitemap.xml.gz

Comments

  • See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file