healthcentral.com
robots.txt

Robots Exclusion Standard data for healthcentral.com

Resource Scan

Scan Details

Site Domain healthcentral.com
Base Domain healthcentral.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-10-28T02:54:50+00:00
Next Scan 2024-12-27T02:54:50+00:00

Last Successful Scan

Scanned2024-03-03T02:02:19+00:00
URL https://healthcentral.com/robots.txt
Domain IPs 54.192.18.127, 54.192.18.48, 54.192.18.76, 54.192.18.83
Response IP 13.224.249.57
Found Yes
Hash be9b4aff53b8b839bcdc5a2adc6145bdb0e3bb0645899b10aa6733611f1208b7
SimHash 3a56c9404393

Groups

*

Rule Path
Allow /index.xml
Disallow /healthcare08/app/
Disallow /*_pf.html
Disallow /PrinterFriendly/
Disallow /PrinterFriendly_hc/
Disallow /ads/
Disallow /common/
Allow /common/images/
Allow /common/h/sitemaps/
Disallow /*/pf/
Disallow /noindex/
Disallow /printerfriendly/
Disallow /syndication/
Disallow /utils/
Allow /utils/news/
Disallow /*/emailafriend.php
Disallow /*/includes/
Disallow /*/meet-community.html
Disallow /*/news-includes/
Disallow /*/news-archive-includes/
Disallow /*/privacy-policy.html
Disallow /*/service-terms.html
Disallow /*/whats-new.html
Disallow /ProdStage
Disallow /profiles/
Disallow /bipolar/c/9994
Disallow /bipolar/c/9994/
Disallow /*/feed/

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.healthcentral.com/index.xml

Comments

  • DBSA
  • outgoing feeds, eg WorldNow

Warnings

  • 1 invalid line.