healthaffairs.org
robots.txt

Robots Exclusion Standard data for healthaffairs.org

Resource Scan

Scan Details

Site Domain healthaffairs.org
Base Domain healthaffairs.org
Scan Status Ok
Last Scan2024-10-29T05:51:17+00:00
Next Scan 2024-11-28T05:51:17+00:00

Last Scan

Scanned2024-10-29T05:51:17+00:00
URL https://healthaffairs.org/robots.txt
Redirect https://www.healthaffairs.org/robots.txt
Redirect Domain www.healthaffairs.org
Redirect Base healthaffairs.org
Domain IPs 104.18.38.235, 172.64.149.21
Redirect IPs 104.18.38.235, 172.64.149.21
Response IP 104.18.38.235
Found Yes
Hash c7f2f99f189ba3da5ecce3ccd765e26c894c367e9cd3ee6d36ad2dc3d6ec8deb
SimHash 693cdce0cbd2

Groups

*

Rule Path
Disallow /action
Disallow /personalize/
Disallow /search
Disallow /feedback
Disallow /rss
Disallow /page/account-confirmation-thanks
Disallow /medical-research
Disallow /servlet/linkout
Disallow /na101/
Disallow /na101v1/
Disallow /na102/
Disallow /doi/mlt/
Disallow /cdn-cg/
Disallow /*startPage
Allow /action/showJournal
Allow /action/showPublications
Allow /action/showXml
Allow /action/showTopic
Allow /action/showBook
Allow /action/showCoverImage

facebookexternalhit
linkedinbot
twitterbot

Rule Path
Allow /

gptbot

Rule Path
Disallow /

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://www.healthaffairs.org/sitemap-index-1.txt