hoarient.com
robots.txt

Robots Exclusion Standard data for hoarient.com

Resource Scan

Scan Details

Site Domain hoarient.com
Base Domain hoarient.com
Scan Status Ok
Last Scan2025-11-01T03:04:44+00:00
Next Scan 2025-12-01T03:04:44+00:00

Last Scan

Scanned2025-11-01T03:04:44+00:00
URL https://hoarient.com/robots.txt
Domain IPs 118.69.80.54
Response IP 118.69.80.54
Found Yes
Hash bf457e3be71b344e3a009ce7d30f714959a94b3df41c8cdc3a51120a165305cd
SimHash af15de4ad6c0

Groups

*

Rule Path
Disallow /admin
Disallow /cart
Disallow /carts
Disallow /orders
Disallow /checkout
Disallow /checkouts
Disallow /account
Disallow /collections/*%2B*
Disallow /collections/*%2B*
Disallow /collections/*%2B*
Disallow /blogs/*%2B*
Disallow /blogs/*%2B*
Disallow /blogs/*%2B*
Disallow /search
Disallow /discount/*
Disallow /apple-app-site-association

adsbot-google

Rule Path
Disallow /checkout
Disallow /checkouts
Disallow /carts
Disallow /orders
Disallow /discount/*

nutch

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /admin
Disallow /cart
Disallow /carts
Disallow /orders
Disallow /checkout
Disallow /checkouts
Disallow /account
Disallow /collections/*%2B*
Disallow /collections/*%2B*
Disallow /collections/*%2B*
Disallow /blogs/*%2B*
Disallow /blogs/*%2B*
Disallow /blogs/*%2B*
Disallow /search
Disallow /discount/*
Disallow /apple-app-site-association

Other Records

Field Value
crawl-delay 10

ahrefssiteaudit

Rule Path
Disallow /admin
Disallow /cart
Disallow /carts
Disallow /orders
Disallow /checkout
Disallow /checkouts
Disallow /account
Disallow /collections/*%2B*
Disallow /collections/*%2B*
Disallow /collections/*%2B*
Disallow /blogs/*%2B*
Disallow /blogs/*%2B*
Disallow /blogs/*%2B*
Disallow /search
Disallow /discount/*
Disallow /apple-app-site-association

Other Records

Field Value
crawl-delay 10

mj12bot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

pinterest

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://hoarient.com/sitemap.xml
sitemap https://hoarient.com/sitemap.xml
sitemap https://hoarient.com/sitemap.xml

Comments

  • we use Haravan as our ecommerce platform
  • Google adsbot ignores robots.txt unless specifically named!