homchoo.com
robots.txt

Robots Exclusion Standard data for homchoo.com

Resource Scan

Scan Details

Site Domain homchoo.com
Base Domain homchoo.com
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-03-22T17:05:53+00:00
Next Scan 2024-06-20T17:05:53+00:00

Last Successful Scan

Scanned2023-08-03T10:43:14+00:00
URL https://homchoo.com/robots.txt
Redirect https://www.japhneshop.com/robots.txt
Redirect Domain www.japhneshop.com
Redirect Base japhneshop.com
Domain IPs 104.21.89.86, 172.67.157.97, 2606:4700:3033::ac43:9d61, 2606:4700:3036::6815:5956
Redirect IPs 54.192.150.101, 54.192.150.13, 54.192.150.71, 54.192.150.92
Response IP 54.192.150.71
Found Yes
Hash d4816305773954fa08f77e20134dc9a9b21796d314f9c1f480107782de99e185
SimHash 815c9f11cdd6

Groups

mj12bot

Rule Path
Disallow /

hubspot

Rule Path
Disallow /

*

Rule Path
Disallow /closed
Disallow /preview/
Disallow /users/
Disallow /orders
Disallow /*?*debug=*
Disallow /*?*theme_preview=*
Disallow /*?*price_range_preview=*
Disallow /*?*draft=*
Disallow /api/
Disallow /themes/

Other Records

Field Value
sitemap https://www.japhneshop.com/sitemap.xml

Comments

  • robots.txt file for Shopline Merchant
  • split user-agent disallows for different bots, as not all bots may follow google's multi-user-agent standard
  • Allow crawling of all content except