healthnews.com
robots.txt

Robots Exclusion Standard data for healthnews.com

Resource Scan

Scan Details

Site Domain healthnews.com
Base Domain healthnews.com
Scan Status Ok
Last Scan2024-09-16T16:56:05+00:00
Next Scan 2024-09-23T16:56:05+00:00

Last Scan

Scanned2024-09-16T16:56:05+00:00
URL https://healthnews.com/robots.txt
Domain IPs 172.66.40.104, 172.66.43.152, 2606:4700:3108::ac42:2868, 2606:4700:3108::ac42:2b98
Response IP 172.66.40.104
Found Yes
Hash b2726afc62f7e749363f14f38635bb9fc28d6c8a5f2b0a9925177b26f3c3035a
SimHash 4b54d871e153

Groups

*

Rule Path
Disallow /collections/vendors*
Disallow /api/
Disallow /search/*
Disallow /?s=*

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

Other Records

Field Value
sitemap https://healthnews.com/sitemap_index.xml
sitemap https://healthnews.com/news-sitemap.xml