leith.co.uk
robots.txt

Robots Exclusion Standard data for leith.co.uk

Resource Scan

Scan Details

Site Domain leith.co.uk
Base Domain leith.co.uk
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-10-08T21:53:40+00:00
Next Scan 2026-01-06T21:53:40+00:00

Last Successful Scan

Scanned2025-06-04T13:36:43+00:00
URL https://leith.co.uk/robots.txt
Domain IPs 104.26.2.147, 104.26.3.147, 172.67.70.129, 2606:4700:20::681a:293, 2606:4700:20::681a:393, 2606:4700:20::ac43:4681
Response IP 104.26.3.147
Found Yes
Hash 55226ce3cd4fd9fd3dcd2c4f622bf24e88fff1ef5c2ce43a50dcabceace823c1
SimHash a11c99225776

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

Other Records

Field Value
sitemap https://leith.co.uk/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://leith.co.uk/
  • live - don't allow web crawlers to index cpresources/ or vendor/
  • Disallow ChatGPT bot, as there's no benefit to allowing it to index your site
  • Disallow Google Bard and Vertex AI bots, as there's no benefit to allowing it to index your site