josephgallagher.co.uk
robots.txt

Robots Exclusion Standard data for josephgallagher.co.uk

Resource Scan

Scan Details

Site Domain josephgallagher.co.uk
Base Domain josephgallagher.co.uk
Scan Status Ok
Last Scan2026-03-27T01:36:44+00:00
Next Scan 2026-04-10T01:36:44+00:00

Last Scan

Scanned2026-03-27T01:36:44+00:00
URL https://josephgallagher.co.uk/robots.txt
Redirect https://www.josephgallagher.co.uk/robots.txt
Redirect Domain www.josephgallagher.co.uk
Redirect Base josephgallagher.co.uk
Domain IPs 76.76.21.21
Redirect IPs 34.247.197.143, 52.211.242.121, 52.30.130.86
Response IP 52.211.242.121
Found Yes
Hash 4772657caf8b254b8f7340fb16124503c65b2d53840e627f1b7007b3e1786796
SimHash c3209d762593

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/
Disallow /index.php/actions/
Disallow /esi/

Other Records

Field Value
sitemap https://www.josephgallagher.co.uk/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://www.josephgallagher.co.uk/
  • live - don't allow web crawlers to index cpresources/ or vendor/