healthtimes.co.uk
robots.txt

Robots Exclusion Standard data for healthtimes.co.uk

Resource Scan

Scan Details

Site Domain healthtimes.co.uk
Base Domain healthtimes.co.uk
Scan Status Ok
Last Scan2024-09-18T22:23:07+00:00
Next Scan 2024-09-25T22:23:07+00:00

Last Scan

Scanned2024-09-18T22:23:07+00:00
URL https://healthtimes.co.uk/robots.txt
Redirect https://www.healthtimes.co.uk/robots.txt
Redirect Domain www.healthtimes.co.uk
Redirect Base healthtimes.co.uk
Domain IPs 104.21.76.104, 172.67.193.14, 2606:4700:3036::ac43:c10e, 2606:4700:3037::6815:4c68
Redirect IPs 104.21.76.104, 172.67.193.14, 2606:4700:3036::ac43:c10e, 2606:4700:3037::6815:4c68
Response IP 104.21.76.104
Found Yes
Hash e3bb706b22e4e5e5290aa7049efc28c46d0d68a3733d4a9778e7fb7e75fe2930
SimHash e615bb4187e2

Groups

datocmssearchbot

Rule Path
Disallow */amp
Disallow /amp
Disallow /authors/*
Disallow /website-terms-of-use
Disallow /website-privacy-policy
Disallow /cookies
Disallow /advertise-with-us
Disallow /search
Disallow /notice-takedown-policy
Disallow /research-policy

*

Rule Path
Allow /
Disallow /**/*-agtestag
Disallow /*page-data.json
Disallow /intermediate
Disallow /search

Other Records

Field Value
sitemap https://www.healthtimes.co.uk/sitemap-0.xml

Warnings

  • `host` is not a known field.