healio.com
robots.txt

Robots Exclusion Standard data for healio.com

Resource Scan

Scan Details

Site Domain healio.com
Base Domain healio.com
Scan Status Ok
Last Scan2024-10-30T15:57:39+00:00
Next Scan 2024-11-06T15:57:39+00:00

Last Scan

Scanned2024-10-30T15:57:39+00:00
URL https://healio.com/robots.txt
Redirect https://www.healio.com/robots.txt
Redirect Domain www.healio.com
Redirect Base healio.com
Domain IPs 107.154.108.198, 107.154.110.198
Redirect IPs 45.64.67.198
Response IP 45.64.67.198
Found Yes
Hash 89a9b859efae8f8c5fd2f98e752f4664e6031f23bde6405aa4c83c05719b5e87
SimHash 2c5932604f32

Groups

*

Rule Path
Disallow /~/user/
Disallow /*.aspx
Disallow /136749668/
Disallow /6985521/
Disallow /_Incapsula_Resource
Disallow /cws/
Disallow /presentation/
Disallow /Presentation/
Disallow /search
Disallow /Search
Disallow /shop/
Disallow /sitecore/
Disallow /sws/
Disallow /trk/
Disallow /webservices/
Allow /sws/feed/news/*

chatgpt

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

perplexity.ai

Rule Path
Disallow /

jasper.ai

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

Comments

  • Disallow: /*/json/
  • Disallow: /~/hws/
  • Disallow: /h5news/
  • Disallow: /find/

Warnings

  • 1 invalid line.