channel4.co.uk
robots.txt
Robots Exclusion Standard data for channel4.co.uk
Resource Scan
Scan Details
Site Domain | channel4.co.uk |
Base Domain | channel4.co.uk |
Scan Status | Ok |
Last Scan | 2024-11-09T13:27:36+00:00 |
Next Scan | 2024-11-16T13:27:36+00:00 |
Last Scan
Scanned | 2024-11-09T13:27:36+00:00 |
URL | http://channel4.co.uk/robots.txt |
Redirect | https://www.channel4.com/robots.txt |
Redirect Domain | www.channel4.com |
Redirect Base | channel4.com |
Domain IPs | 54.217.217.1, 54.72.208.94 |
Redirect IPs | 23.207.182.67 |
Response IP | 23.54.57.184 |
Found | Yes |
Hash | e8d33369a253562578e768334518d47bf9893a1088c21a24c1f961035d113094 |
SimHash | e9015c70c333 |
Groups
*
Rule | Path |
---|---|
Disallow | /news/?* |
Disallow | /news/*/?* |
Disallow | /press/unregistered-image-search?* |
Disallow | /press/content-search?* |
Other Records
Field | Value |
---|---|
sitemap | https://www.channel4.com/news/sitemap.xml |
sitemap | https://www.channel4.com/sitemap.xml |
Warnings
- 1 invalid line.