channel4.co.uk
robots.txt

Robots Exclusion Standard data for channel4.co.uk

Resource Scan

Scan Details

Site Domain channel4.co.uk
Base Domain channel4.co.uk
Scan Status Ok
Last Scan2024-11-09T13:27:36+00:00
Next Scan 2024-11-16T13:27:36+00:00

Last Scan

Scanned2024-11-09T13:27:36+00:00
URL http://channel4.co.uk/robots.txt
Redirect https://www.channel4.com/robots.txt
Redirect Domain www.channel4.com
Redirect Base channel4.com
Domain IPs 54.217.217.1, 54.72.208.94
Redirect IPs 23.207.182.67
Response IP 23.54.57.184
Found Yes
Hash e8d33369a253562578e768334518d47bf9893a1088c21a24c1f961035d113094
SimHash e9015c70c333

Groups

turnitinbot

Rule Path
Disallow /

*

Rule Path
Disallow /news/?*
Disallow /news/*/?*
Disallow /press/unregistered-image-search?*
Disallow /press/content-search?*

Other Records

Field Value
sitemap https://www.channel4.com/news/sitemap.xml
sitemap https://www.channel4.com/sitemap.xml

Warnings

  • 1 invalid line.