thrive.uk.com
robots.txt
Robots Exclusion Standard data for thrive.uk.com
Resource Scan
Scan Details
| Site Domain | thrive.uk.com |
| Base Domain | thrive.uk.com |
| Scan Status | Ok |
| Last Scan | 2025-12-14T08:36:28+00:00 |
| Next Scan | 2026-01-13T08:36:28+00:00 |
Last Scan
| Scanned | 2025-12-14T08:36:28+00:00 |
| URL | https://thrive.uk.com/robots.txt |
| Domain IPs | 199.60.103.121, 199.60.103.21 |
| Response IP | 199.60.103.121 |
| Found | Yes |
| Hash | 4b279339a558cafe2bcee8603851eb88b3608c75e0987ad0106fdb1409f4ed66 |
| SimHash | 2271cea0cdb3 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /sample-* |
| Disallow | /blog/sample-* |
| Disallow | */blog/author/* |
| Disallow | */blog/page/* |
| Disallow | */blog/tag/* |
| Disallow | /_hcms/preview/ |
| Disallow | /hs/manage-preferences/ |
| Disallow | /hs/preferences-center/ |
| Disallow | /*?*hs_preview=* |
| Disallow | /*?*hsCacheBuster=* |
Other Records
| Field | Value |
|---|---|
| sitemap | https://thrive.uk.com/sitemap.xml |