e4.com
robots.txt
Robots Exclusion Standard data for e4.com
Resource Scan
Scan Details
Site Domain | e4.com |
Base Domain | e4.com |
Scan Status | Ok |
Last Scan | 2024-11-13T15:57:13+00:00 |
Next Scan | 2024-11-20T15:57:13+00:00 |
Last Scan
Scanned | 2024-11-13T15:57:13+00:00 |
URL | http://e4.com/robots.txt |
Redirect | https://www.channel4.com/robots.txt |
Redirect Domain | www.channel4.com |
Redirect Base | channel4.com |
Domain IPs | 34.251.236.254, 54.72.208.94 |
Redirect IPs | 23.36.50.42 |
Response IP | 23.53.162.20 |
Found | Yes |
Hash | e8d33369a253562578e768334518d47bf9893a1088c21a24c1f961035d113094 |
SimHash | e9015c70c333 |
Groups
*
Rule | Path |
---|---|
Disallow | /news/?* |
Disallow | /news/*/?* |
Disallow | /press/unregistered-image-search?* |
Disallow | /press/content-search?* |
Other Records
Field | Value |
---|---|
sitemap | https://www.channel4.com/news/sitemap.xml |
sitemap | https://www.channel4.com/sitemap.xml |
Warnings
- 1 invalid line.