shephard.co.uk
robots.txt
Robots Exclusion Standard data for shephard.co.uk
Resource Scan
Scan Details
| Site Domain | shephard.co.uk |
| Base Domain | shephard.co.uk |
| Scan Status | Ok |
| Last Scan | 2025-11-03T17:02:46+00:00 |
| Next Scan | 2025-12-03T17:02:46+00:00 |
Last Scan
| Scanned | 2025-11-03T17:02:46+00:00 |
| URL | http://shephard.co.uk/robots.txt |
| Redirect | https://shephardmedia.com/robots.txt |
| Redirect Domain | shephardmedia.com |
| Redirect Base | shephardmedia.com |
| Domain IPs | 16.15.186.202, 16.15.194.58, 16.182.96.253, 52.216.43.229, 52.216.93.170, 52.217.173.181, 52.217.73.131, 52.217.90.27 |
| Redirect IPs | 104.21.44.142, 172.67.200.196, 2606:4700:3033::6815:2c8e, 2606:4700:3035::ac43:c8c4 |
| Response IP | 172.67.200.196 |
| Found | Yes |
| Hash | b36b07396e390b2566e9057fc9d8dbb41b9e1537ee2d96eaa2e6c2be9597f730 |
| SimHash | 62340d494fa4 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /news/login |
| Disallow | /news/feed/ |
Other Records
| Field | Value |
|---|---|
| crawl-delay | 1 |
Other Records
| Field | Value |
|---|---|
| sitemap | https://www.shephardmedia.com/sitemap.xml |
Comments