thewatermill-dorking.co.uk
robots.txt
Robots Exclusion Standard data for thewatermill-dorking.co.uk
Resource Scan
Scan Details
Site Domain | thewatermill-dorking.co.uk |
Base Domain | thewatermill-dorking.co.uk |
Scan Status | Ok |
Last Scan | 2024-10-13T11:41:25+00:00 |
Next Scan | 2024-11-12T11:41:25+00:00 |
Last Scan
Scanned | 2024-10-13T11:41:25+00:00 |
URL | https://www.thewatermill-dorking.co.uk/robots.txt |
Domain IPs | 2600:1413:b000:6::17d5:2bca, 2600:1413:b000:6::17d5:2be0, 96.17.96.16, 96.17.96.20 |
Response IP | 23.54.118.45 |
Found | Yes |
Hash | f40eab6dc024aa2ad52323bf9dfac9ed9aea5535c6ba7964e0c1fa6a1c2decb8 |
SimHash | 1b075b654f51 |
Groups
*
Rule | Path |
---|---|
Disallow | /App_Data/ |
Disallow | /masterpages/ |
Disallow | /bin/ |
Disallow | /config/ |
Disallow | /css/ |
Disallow | /data/ |
Disallow | /js/ |
Disallow | /images/ |
Disallow | /includes/ |
Disallow | /media/ |
Disallow | /Properties/ |
Disallow | /scripts/ |
Disallow | /sitecore/ |
Disallow | */sitecore/* |
Disallow | /usercontrols/ |
Disallow | /xslt/ |
Disallow | /Web.config |
Other Records
Field | Value |
---|---|
sitemap | https://www.thewatermill-dorking.co.uk/sitemap.xml |