channel4.com
robots.txt

Robots Exclusion Standard data for channel4.com

Resource Scan

Scan Details

Site Domain channel4.com
Base Domain channel4.com
Scan Status Ok
Last Scan2024-05-27T23:32:23+00:00
Next Scan 2024-06-03T23:32:23+00:00

Last Scan

Scanned2024-05-27T23:32:23+00:00
URL https://channel4.com/robots.txt
Redirect https://www.channel4.com/robots.txt
Redirect Domain www.channel4.com
Redirect Base channel4.com
Domain IPs 54.78.179.192, 63.34.215.152
Redirect IPs 23.53.217.24
Response IP 23.54.57.184
Found Yes
Hash e8d33369a253562578e768334518d47bf9893a1088c21a24c1f961035d113094
SimHash e9015c70c333

Groups

turnitinbot

Rule Path
Disallow /

*

Rule Path
Disallow /news/?*
Disallow /news/*/?*
Disallow /press/unregistered-image-search?*
Disallow /press/content-search?*

Other Records

Field Value
sitemap https://www.channel4.com/news/sitemap.xml
sitemap https://www.channel4.com/sitemap.xml

Warnings

  • 1 invalid line.