e4.com
robots.txt

Robots Exclusion Standard data for e4.com

Resource Scan

Scan Details

Site Domain e4.com
Base Domain e4.com
Scan Status Ok
Last Scan2024-11-13T15:57:13+00:00
Next Scan 2024-11-20T15:57:13+00:00

Last Scan

Scanned2024-11-13T15:57:13+00:00
URL http://e4.com/robots.txt
Redirect https://www.channel4.com/robots.txt
Redirect Domain www.channel4.com
Redirect Base channel4.com
Domain IPs 34.251.236.254, 54.72.208.94
Redirect IPs 23.36.50.42
Response IP 23.53.162.20
Found Yes
Hash e8d33369a253562578e768334518d47bf9893a1088c21a24c1f961035d113094
SimHash e9015c70c333

Groups

turnitinbot

Rule Path
Disallow /

*

Rule Path
Disallow /news/?*
Disallow /news/*/?*
Disallow /press/unregistered-image-search?*
Disallow /press/content-search?*

Other Records

Field Value
sitemap https://www.channel4.com/news/sitemap.xml
sitemap https://www.channel4.com/sitemap.xml

Warnings

  • 1 invalid line.