johnpreston.co.uk
robots.txt

Robots Exclusion Standard data for johnpreston.co.uk

Resource Scan

Scan Details

Site Domain johnpreston.co.uk
Base Domain johnpreston.co.uk
Scan Status Ok
Last Scan2025-03-10T00:15:41+00:00
Next Scan 2025-04-09T00:15:41+00:00

Last Scan

Scanned2025-03-10T00:15:41+00:00
URL https://johnpreston.co.uk/robots.txt
Domain IPs 104.21.39.108, 172.67.144.145, 2606:4700:3033::ac43:9091, 2606:4700:3034::6815:276c
Response IP 104.21.39.108
Found Yes
Hash 0a0eb0fe9676ed116c0cb7f64071d585ca4c915988bd2403297d4c8df05c04cf
SimHash ef2ddcdbe253

Groups

ahrefsbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 5

*

Rule Path
Disallow /index.php/
Disallow /*?
Disallow /checkout/
Disallow /app/
Disallow /lib/
Disallow /*.php$
Disallow /pkginfo/
Disallow /report/
Disallow /var/
Disallow /catalog/
Disallow /customer/
Disallow /sendfriend/
Disallow /review/
Disallow /*SID%3D

googlebot

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.johnpreston.co.uk/sitemap.xml
sitemap https://www.johnpreston.ie/sitemap_ie.xml

Warnings

  • 1 invalid line.