growershouse.com
robots.txt

Robots Exclusion Standard data for growershouse.com

Resource Scan

Scan Details

Site Domain growershouse.com
Base Domain growershouse.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-08-07T03:10:56+00:00
Next Scan 2024-11-05T03:10:56+00:00

Last Successful Scan

Scanned2023-07-15T03:05:03+00:00
URL https://growershouse.com/robots.txt
Domain IPs 104.22.48.150, 104.22.49.150, 172.67.11.128, 2606:4700:10::6816:3096, 2606:4700:10::6816:3196, 2606:4700:10::ac43:b80
Response IP 104.22.49.150
Found Yes
Hash f73e138614143817ccef6436c5ab365809aae40f87c59e13b9cd71158cc4aa32
SimHash cb34f852e353

Groups

googlebot-image

Rule Path
Allow /media/catalog/product/image/*/*.jpg$
Allow /media/catalog/product/image/*/*.png$
Allow /media/catalog/product/image/*/*.gif$
Disallow /media/catalog/product/image/*/*.jpg/*
Disallow /media/catalog/product/image/*/*.png/*
Disallow /media/catalog/product/image/*/*.gif/*

*

Rule Path
Disallow /index.php/
Disallow /*?
Disallow /checkout/
Disallow /app/
Disallow /lib/
Disallow /*.php$
Disallow /pkginfo/
Disallow /report/
Disallow /var/
Disallow /catalog/
Disallow /customer/
Disallow /sendfriend/
Disallow /review/
Disallow /*SID%3D
Disallow /home-newt-61f0746ec53dd/
Disallow /catalogsearch/
Disallow /catalog/product_compare/
Disallow /catalog/category/view/
Disallow /catalog/product/view/
Disallow /*?dir*
Disallow /*?dir=desc
Disallow /*?dir=asc
Disallow /*?limit=all
Disallow /?modes
Disallow /*?
Disallow /catalogsearch/result*?

blexbot
mj12bot
awariorssbot
awariosmartbot
sentibot
siteauditbot
ahrefsbot
ccbot
infotigerbot
panscient.com

Rule Path
Disallow /

Other Records

Field Value
sitemap https://growershouse.com/media/sitemap.xml

Comments

  • Allow product images without query params
  • Disallow product images without query params
  • Restrict Catalog Search Page
  • Disallow URL Filter Searches