keenwellbeing.com
robots.txt

Robots Exclusion Standard data for keenwellbeing.com

Resource Scan

Scan Details

Site Domain keenwellbeing.com
Base Domain keenwellbeing.com
Scan Status Ok
Last Scan2026-01-21T22:58:46+00:00
Next Scan 2026-02-20T22:58:46+00:00

Last Scan

Scanned2026-01-21T22:58:46+00:00
URL https://keenwellbeing.com/robots.txt
Redirect https://www.keenwellbeing.com/robots.txt
Redirect Domain www.keenwellbeing.com
Redirect Base keenwellbeing.com
Domain IPs 104.21.6.47, 172.67.154.235, 2606:4700:3033::ac43:9aeb, 2606:4700:3035::6815:62f
Redirect IPs 104.21.6.47, 172.67.154.235, 2606:4700:3033::ac43:9aeb, 2606:4700:3035::6815:62f
Response IP 104.21.6.47
Found Yes
Hash 092071023f5a261a2723e28c63df951e47b15cb6e9f2097ed328cbf3c0e7c63c
SimHash e1509b624776

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/

gptbot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

perplexitybot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.keenwellbeing.com/sitemaps-1-sitemap.xml
sitemap https://www.keenwellbeing.com/de/sitemaps-1-sitemap.xml
sitemap https://www.keenwellbeing.com/fr/sitemaps-1-sitemap.xml

Comments

  • robots.txt for https://www.keenwellbeing.com/
  • live - don't allow web crawlers to index cpresources/ or vendor/
  • Disallow ChatGPT bot, as there's no benefit to allowing it to index your site
  • Disallow Google Bard and Vertex AI bots, as there's no benefit to allowing it to index your site
  • Disallow Perplexity bot, as there's no benefit to allowing it to index your site