thrive.uk.com
robots.txt

Robots Exclusion Standard data for thrive.uk.com

Resource Scan

Scan Details

Site Domain thrive.uk.com
Base Domain thrive.uk.com
Scan Status Ok
Last Scan2025-12-14T08:36:28+00:00
Next Scan 2026-01-13T08:36:28+00:00

Last Scan

Scanned2025-12-14T08:36:28+00:00
URL https://thrive.uk.com/robots.txt
Domain IPs 199.60.103.121, 199.60.103.21
Response IP 199.60.103.121
Found Yes
Hash 4b279339a558cafe2bcee8603851eb88b3608c75e0987ad0106fdb1409f4ed66
SimHash 2271cea0cdb3

Groups

*

Rule Path
Disallow /sample-*
Disallow /blog/sample-*
Disallow */blog/author/*
Disallow */blog/page/*
Disallow */blog/tag/*
Disallow /_hcms/preview/
Disallow /hs/manage-preferences/
Disallow /hs/preferences-center/
Disallow /*?*hs_preview=*
Disallow /*?*hsCacheBuster=*

Other Records

Field Value
sitemap https://thrive.uk.com/sitemap.xml