johnpreston.co.uk
robots.txt
Robots Exclusion Standard data for johnpreston.co.uk
Resource Scan
Scan Details
Site Domain | johnpreston.co.uk |
Base Domain | johnpreston.co.uk |
Scan Status | Ok |
Last Scan | 2025-03-10T00:15:41+00:00 |
Next Scan | 2025-04-09T00:15:41+00:00 |
Last Scan
Scanned | 2025-03-10T00:15:41+00:00 |
URL | https://johnpreston.co.uk/robots.txt |
Domain IPs | 104.21.39.108, 172.67.144.145, 2606:4700:3033::ac43:9091, 2606:4700:3034::6815:276c |
Response IP | 104.21.39.108 |
Found | Yes |
Hash | 0a0eb0fe9676ed116c0cb7f64071d585ca4c915988bd2403297d4c8df05c04cf |
SimHash | ef2ddcdbe253 |
Groups
*
Rule | Path |
---|---|
Disallow | /index.php/ |
Disallow | /*? |
Disallow | /checkout/ |
Disallow | /app/ |
Disallow | /lib/ |
Disallow | /*.php$ |
Disallow | /pkginfo/ |
Disallow | /report/ |
Disallow | /var/ |
Disallow | /catalog/ |
Disallow | /customer/ |
Disallow | /sendfriend/ |
Disallow | /review/ |
Disallow | /*SID%3D |
Other Records
Field | Value |
---|---|
sitemap | https://www.johnpreston.co.uk/sitemap.xml |
sitemap | https://www.johnpreston.ie/sitemap_ie.xml |
Warnings
- 1 invalid line.