webheads.co.uk
robots.txt
Robots Exclusion Standard data for webheads.co.uk
Resource Scan
Scan Details
| Site Domain | webheads.co.uk |
| Base Domain | webheads.co.uk |
| Scan Status | Ok |
| Last Scan | 2026-01-29T23:49:45+00:00 |
| Next Scan | 2026-02-28T23:49:45+00:00 |
Last Scan
| Scanned | 2026-01-29T23:49:45+00:00 |
| URL | https://webheads.co.uk/robots.txt |
| Domain IPs | 84.18.203.40 |
| Response IP | 84.18.203.40 |
| Found | Yes |
| Hash | 466b58fa3e06facf90ecd02c2c249d4ae4a43632a0b3533597f2cc08dc053062 |
| SimHash | 28229cc0259f |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /_mm/ |
| Disallow | /_notes/ |
| Disallow | /_baks/ |
| Disallow | /cgi/ |
| Disallow | /trash/ |
| Disallow | /sleddog/ |
| Disallow | /santa-placeholder/* |
| Disallow | /earl-placeholder/* |
| Disallow | /eland-placeholder/* |
| Disallow | /web-blog/items/* |
| Disallow | /web-blog/* |
| Disallow | /search/ |
| Disallow | /?* |
| Disallow | /web-portfolio/the-vault/* |