clinician.com
robots.txt

Robots Exclusion Standard data for clinician.com

Resource Scan

Scan Details

Site Domain clinician.com
Base Domain clinician.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-12-12T01:38:10+00:00
Next Scan 2026-03-12T01:38:10+00:00

Last Successful Scan

Scanned2025-08-13T10:47:17+00:00
URL https://clinician.com/robots.txt
Redirect https://www.clinician.com/robots.txt
Redirect Domain www.clinician.com
Redirect Base clinician.com
Domain IPs 104.18.10.41, 104.18.11.41, 2606:4700::6812:a29, 2606:4700::6812:b29
Redirect IPs 104.18.10.41, 104.18.11.41, 2606:4700::6812:a29, 2606:4700::6812:b29
Response IP 104.18.11.41
Found Yes
Hash 8f4f9d5037f73b3a92536c21582f86130e4192b9349252f9b1f381b806972229
SimHash e1601d622f12

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/
Disallow /ahc/img/

gptbot

Rule Path
Disallow /blogs

Other Records

Field Value
sitemap https://www.clinician.com/sitemaps-3-sitemap.xml

Comments

  • robots.txt for https://www.clinician.com/
  • live - don't allow web crawlers to index cpresources/ or vendor/