webheads.co.uk
robots.txt

Robots Exclusion Standard data for webheads.co.uk

Resource Scan

Scan Details

Site Domain webheads.co.uk
Base Domain webheads.co.uk
Scan Status Ok
Last Scan2026-01-29T23:49:45+00:00
Next Scan 2026-02-28T23:49:45+00:00

Last Scan

Scanned2026-01-29T23:49:45+00:00
URL https://webheads.co.uk/robots.txt
Domain IPs 84.18.203.40
Response IP 84.18.203.40
Found Yes
Hash 466b58fa3e06facf90ecd02c2c249d4ae4a43632a0b3533597f2cc08dc053062
SimHash 28229cc0259f

Groups

*

Rule Path
Disallow /_mm/
Disallow /_notes/
Disallow /_baks/
Disallow /cgi/
Disallow /trash/
Disallow /sleddog/
Disallow /santa-placeholder/*
Disallow /earl-placeholder/*
Disallow /eland-placeholder/*
Disallow /web-blog/items/*
Disallow /web-blog/*
Disallow /search/
Disallow /?*
Disallow /web-portfolio/the-vault/*