acrowonlinestore.com
robots.txt

Robots Exclusion Standard data for acrowonlinestore.com

Resource Scan

Scan Details

Site Domain acrowonlinestore.com
Base Domain acrowonlinestore.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-02-18T21:17:00+00:00
Next Scan 2025-04-19T21:17:00+00:00

Last Successful Scan

Scanned2024-12-14T18:09:07+00:00
URL https://acrowonlinestore.com/robots.txt
Redirect https://www.acrowonlinestore.com/robots.txt
Redirect Domain www.acrowonlinestore.com
Redirect Base acrowonlinestore.com
Domain IPs 104.21.91.26, 172.67.208.219, 2606:4700:3032::ac43:d0db, 2606:4700:3034::6815:5b1a
Redirect IPs 104.18.4.212, 104.18.5.212, 2606:4700::6810:2568, 2606:4700::6810:3568
Response IP 104.18.4.212
Found Yes
Hash fc2855b6879ebf514f5d8ad36c61049f4f2246d1fe8d3bf3acfc5019d142d3f1
SimHash 8a410ad7e073

Groups

*

Rule Path
Disallow /client/
Disallow /banners/
Disallow /administration/
Disallow /adverts/search_box/
Disallow /adverts/phone_num/
Disallow /bikes/contact/
Disallow /advert/contact/
Disallow /baby_kids_toddler/contact/
Disallow /admin
Disallow /api
Disallow /carts
Disallow /carts/add_variant
Disallow /subscribe/thank_you
Disallow /orders/thank_you
Disallow /competition/thanks

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

rogerbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

facebookbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

yandex

Rule Path
Disallow /

dealgates bot

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

meanpathbot

Rule Path
Disallow /

wise-guys

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.acrowonlinestore.com/sitemap.xml.gz

Comments

  • See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file