healio.com
robots.txt

Robots Exclusion Standard data for healio.com

Resource Scan

Scan Details

Site Domain healio.com
Base Domain healio.com
Scan Status Ok
Last Scan2024-05-28T18:05:53+00:00
Next Scan 2024-06-04T18:05:53+00:00

Last Scan

Scanned2024-05-28T18:05:53+00:00
URL https://healio.com/robots.txt
Redirect https://www.healio.com/robots.txt
Redirect Domain www.healio.com
Redirect Base healio.com
Domain IPs 107.154.108.198, 107.154.110.198
Redirect IPs 45.64.67.198
Response IP 45.64.67.198
Found Yes
Hash ef754dbb3e97de4949ba36cbfa83948a8cb35b88c304bb0e170639628cba0884
SimHash 2c5932604f32

Groups

*

Rule Path
Disallow /*/json/
Disallow /~/hws/
Disallow /h5news/
Disallow /~/user/
Disallow /*.aspx
Disallow /136749668/
Disallow /6985521/
Disallow /_Incapsula_Resource
Disallow /cws/
Disallow /presentation/
Disallow /Presentation/
Disallow /search
Disallow /Search
Disallow /shop/
Disallow /sitecore/
Disallow /sws/
Disallow /trk/
Disallow /webservices/
Allow /sws/feed/news/*

chatgpt

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

perplexity.ai

Rule Path
Disallow /

jasper.ai

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

Comments

  • Disallow: /find/

Warnings

  • 1 invalid line.