blog.sprucehealth.com
robots.txt
Robots Exclusion Standard data for blog.sprucehealth.com
Resource Scan
Scan Details
Site Domain | blog.sprucehealth.com |
Base Domain | sprucehealth.com |
Scan Status | Ok |
Last Scan | 2024-05-31T16:31:37+00:00 |
Next Scan | 2024-06-30T16:31:37+00:00 |
Last Scan
Scanned | 2024-05-31T16:31:37+00:00 |
URL | https://blog.sprucehealth.com/robots.txt |
Redirect | https://sprucehealth.com:443/robots.txt |
Redirect Domain | sprucehealth.com |
Redirect Base | sprucehealth.com |
Domain IPs | 18.208.233.163, 3.209.128.59, 3.228.135.46 |
Redirect IPs | 18.161.180.100, 18.161.180.37, 18.161.180.48, 18.161.180.79 |
Response IP | 108.156.133.25 |
Found | Yes |
Hash | 3485fa8f061520eb7bc0a2aa3cb550fe367b70461097b6fb9ad76bce9401e68c |
SimHash | 4901bef40eb0 |
Groups
*
Rule | Path |
---|---|
Disallow | /spruce-health-webinar/ |
Disallow | /spruce-health-webinar-3-x/ |
Disallow | /e911/ |
Disallow | /blog-rc/ |
Disallow | /sem1/ |
Disallow | /blog/wp-admin/ |
Disallow | /mh/ |
Allow | /blog/wp-admin/admin-ajax.php |
Other Records
Field | Value |
---|---|
sitemap | https://sprucehealth.com/sitemap.xml |
sitemap | https://sprucehealth.com/blog/sitemap_index.xml |