thecourier.co.uk
robots.txt
Robots Exclusion Standard data for thecourier.co.uk
Resource Scan
Scan Details
Site Domain | thecourier.co.uk |
Base Domain | thecourier.co.uk |
Scan Status | Ok |
Last Scan | 2024-05-20T11:41:30+00:00 |
Next Scan | 2024-05-27T11:41:30+00:00 |
Last Scan
Scanned | 2024-05-20T11:41:30+00:00 |
URL | https://thecourier.co.uk/robots.txt |
Redirect | https://www.thecourier.co.uk/robots.txt |
Redirect Domain | www.thecourier.co.uk |
Redirect Base | thecourier.co.uk |
Domain IPs | 89.106.200.1 |
Redirect IPs | 104.18.28.20, 104.18.29.20, 2606:4700::6812:1c14, 2606:4700::6812:1d14 |
Response IP | 104.18.28.20 |
Found | Yes |
Hash | b12e8957a3da268b9189039c1cfe5d2b080fabc945362fde53b0b2fde654d294 |
SimHash | 18675840b513 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-admin* |
Disallow | *s%3Dfeed |
Disallow | */?s&%3B* |
Disallow | */?s=* |
Disallow | *s%3D* |
Disallow | /search/* |
Disallow | /search?q=* |
Disallow | /?filter* |
Disallow | *?share=* |
*
Rule | Path |
---|---|
Disallow |
Other Records
Field | Value |
---|---|
sitemap | https://www.thecourier.co.uk/news-sitemap.xml |
sitemap | https://www.thecourier.co.uk/sitemap.xml |
Comments