dossier.co
robots.txt

Robots Exclusion Standard data for dossier.co

Resource Scan

Scan Details

Site Domain dossier.co
Base Domain dossier.co
Scan Status Ok
Last Scan2024-10-03T07:44:31+00:00
Next Scan 2024-10-17T07:44:31+00:00

Last Scan

Scanned2024-10-03T07:44:31+00:00
URL https://dossier.co/robots.txt
Domain IPs 23.227.38.32
Response IP 23.227.38.32
Found Yes
Hash 93cf19ab96a143b8473ab6032d17d54a75d931f20ffe8b61942557fa21b1a978
SimHash e5fd5e6371c4

Groups

*

Rule Path
Disallow /admin
Disallow /cart
Disallow /orders
Disallow /checkouts/
Disallow /checkout
Disallow /carts
Disallow /account
Disallow /*?ref=*
Disallow /*search?q=*
Disallow *%7Bsearch_term%7D*
Disallow /*?direction=next&cursor=*search%3Fq=*
Disallow /*?direction=prev&cursor=*search%3Fq=*
Disallow *?yoReviews*
Disallow *?gf_*
Disallow *?yoReviews*
Disallow *?gf_*

adsbot-google

Rule Path
Disallow /checkouts/
Disallow /checkout
Disallow /carts
Disallow /orders

pinterest

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://dossier.co/sitemap.xml

Comments

  • Google adsbot ignores robots.txt unless specifically named!