dcthomson.co.uk
robots.txt
Robots Exclusion Standard data for dcthomson.co.uk
Resource Scan
Scan Details
Site Domain | dcthomson.co.uk |
Base Domain | dcthomson.co.uk |
Scan Status | Ok |
Last Scan | 2024-05-30T03:23:47+00:00 |
Next Scan | 2024-06-06T03:23:47+00:00 |
Last Scan
Scanned | 2024-05-30T03:23:47+00:00 |
URL | https://dcthomson.co.uk/robots.txt |
Redirect | https://www.dcthomson.co.uk/robots.txt |
Redirect Domain | www.dcthomson.co.uk |
Redirect Base | dcthomson.co.uk |
Domain IPs | 89.106.200.1 |
Redirect IPs | 104.18.28.20, 104.18.29.20, 2606:4700::6812:1c14, 2606:4700::6812:1d14 |
Response IP | 104.18.28.20 |
Found | Yes |
Hash | 15ab07bd06871921fdba37d01998ebb520b1f70612147633bafc2d0a3cd3539c |
SimHash | 086318c0b113 |
Groups
*
Rule | Path |
---|---|
Disallow | /wp-admin* |
Disallow | *s%3Dfeed |
Disallow | */?s&%3B* |
Disallow | */?s=* |
Disallow | *s%3D* |
Disallow | /search/* |
Disallow | /search?q=* |
Disallow | /?filter* |
Disallow | *?share=* |
*
Rule | Path |
---|---|
Disallow |
Other Records
Field | Value |
---|---|
sitemap | https://www.dcthomson.co.uk/sitemap_index.xml |
Comments