healthtimes.co.uk
robots.txt
Robots Exclusion Standard data for healthtimes.co.uk
Resource Scan
Scan Details
Site Domain | healthtimes.co.uk |
Base Domain | healthtimes.co.uk |
Scan Status | Ok |
Last Scan | 2024-09-18T22:23:07+00:00 |
Next Scan | 2024-09-25T22:23:07+00:00 |
Last Scan
Scanned | 2024-09-18T22:23:07+00:00 |
URL | https://healthtimes.co.uk/robots.txt |
Redirect | https://www.healthtimes.co.uk/robots.txt |
Redirect Domain | www.healthtimes.co.uk |
Redirect Base | healthtimes.co.uk |
Domain IPs | 104.21.76.104, 172.67.193.14, 2606:4700:3036::ac43:c10e, 2606:4700:3037::6815:4c68 |
Redirect IPs | 104.21.76.104, 172.67.193.14, 2606:4700:3036::ac43:c10e, 2606:4700:3037::6815:4c68 |
Response IP | 104.21.76.104 |
Found | Yes |
Hash | e3bb706b22e4e5e5290aa7049efc28c46d0d68a3733d4a9778e7fb7e75fe2930 |
SimHash | e615bb4187e2 |
Groups
datocmssearchbot
Rule | Path |
---|---|
Disallow | */amp |
Disallow | /amp |
Disallow | /authors/* |
Disallow | /website-terms-of-use |
Disallow | /website-privacy-policy |
Disallow | /cookies |
Disallow | /advertise-with-us |
Disallow | /search |
Disallow | /notice-takedown-policy |
Disallow | /research-policy |
*
Rule | Path |
---|---|
Allow | / |
Disallow | /**/*-agtestag |
Disallow | /*page-data.json |
Disallow | /intermediate |
Disallow | /search |
Other Records
Field | Value |
---|---|
sitemap | https://www.healthtimes.co.uk/sitemap-0.xml |
Warnings
- `host` is not a known field.