simplygoodstuff.com
robots.txt

Robots Exclusion Standard data for simplygoodstuff.com

Resource Scan

Scan Details

Site Domain simplygoodstuff.com
Base Domain simplygoodstuff.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-12-11T05:16:20+00:00
Next Scan 2026-03-11T05:16:20+00:00

Last Successful Scan

Scanned2023-08-22T07:24:28+00:00
URL https://simplygoodstuff.com/robots.txt
Domain IPs 104.19.177.121, 104.19.178.121
Response IP 104.19.177.121
Found Yes
Hash 82a463fb571446808fc69512a788a17a1d03b44c35ff3628f9c63e695296a4fa
SimHash 2c956ed2ebfe

Groups

*

Rule Path
Disallow /checkout.asp
Disallow /add_cart.asp
Disallow /view_cart.asp
Disallow /error.asp
Disallow /shipquote.asp
Disallow /rssfeed.asp
Disallow /mobile/
Disallow /blog.asp
Disallow /fixedurl
Disallow /admin/

googlebot

Rule Path
Disallow /checkout.asp
Disallow /add_cart.asp
Disallow /view_cart.asp
Disallow /error.asp
Disallow /shipquote.asp
Disallow /rssfeed.asp

googlebot-image

Rule Path
Disallow /checkout.asp
Disallow /add_cart.asp
Disallow /view_cart.asp
Disallow /error.asp
Disallow /shipquote.asp
Disallow /rssfeed.asp

Other Records

Field Value
sitemap https://www.simplygoodstuff.com/sitemap.xml

Comments

  • Disallow all crawlers access to certain pages.