goodsearch.org
robots.txt

Robots Exclusion Standard data for goodsearch.org

Resource Scan

Scan Details

Site Domain goodsearch.org
Base Domain goodsearch.org
Scan Status Ok
Last Scan2024-05-21T03:36:57+00:00
Next Scan 2024-05-28T03:36:57+00:00

Last Scan

Scanned2024-05-21T03:36:57+00:00
URL https://goodsearch.org/robots.txt
Redirect https://www.goodshop.com/robots.txt
Redirect Domain www.goodshop.com
Redirect Base goodshop.com
Domain IPs 104.21.72.149, 172.67.151.100, 2606:4700:3032::6815:4895, 2606:4700:3035::ac43:9764
Redirect IPs 104.20.85.30, 104.20.86.30, 172.67.0.240, 2606:4700:10::6814:551e, 2606:4700:10::6814:561e, 2606:4700:10::ac43:f0
Response IP 104.20.85.30
Found Yes
Hash 27b0d9d1eccca138e44547cc8a5f5afcbbf8354b53a0e10cde69bc158704e858
SimHash e52ddc446912

Groups

*

Rule Path
Disallow /d/
Disallow /raise-money-with-goodshop
Disallow /track/get-user-ids.json
Disallow /deals/
Disallow /m?*
Disallow /m/
Disallow /partners/
Disallow /nonprofit/
Disallow /toolbar/
Disallow /cloudinary/
Disallow /user/
Disallow /sso
Disallow /login
Disallow /register
Disallow /give/login
Disallow /give/signup
Disallow /give/new
Disallow /causes/search
Disallow /amp_access/login
Disallow /amp_access/register
Disallow /*?charityid
Disallow /*?page
Disallow /wp-admin
Disallow /wp-admin/
Disallow /wp-login.php
Disallow /wp-trackback.php
Disallow /xmlrpc.php
Disallow /*%26utm_source
Disallow /*%26utm_medium
Disallow /*%26utm_campaign
Disallow /*?open
Disallow /*?search_open
Disallow /ahoy/

Other Records

Field Value
sitemap https://www.goodshop.com/sitemap-index.xml
sitemap https://www.goodshop.com/sitemap-main.xml
sitemap https://www.goodshop.com/sitemap-merchants.xml
sitemap https://www.goodshop.com/curbside/sitemap.xml
sitemap https://www.goodshop.com/blog/post-sitemap.xml
sitemap https://www.goodshop.com/blog/page-sitemap.xml