healio.com
robots.txt
Robots Exclusion Standard data for healio.com
Resource Scan
Scan Details
Site Domain | healio.com |
Base Domain | healio.com |
Scan Status | Ok |
Last Scan | 2024-05-28T18:05:53+00:00 |
Next Scan | 2024-06-04T18:05:53+00:00 |
Last Scan
Scanned | 2024-05-28T18:05:53+00:00 |
URL | https://healio.com/robots.txt |
Redirect | https://www.healio.com/robots.txt |
Redirect Domain | www.healio.com |
Redirect Base | healio.com |
Domain IPs | 107.154.108.198, 107.154.110.198 |
Redirect IPs | 45.64.67.198 |
Response IP | 45.64.67.198 |
Found | Yes |
Hash | ef754dbb3e97de4949ba36cbfa83948a8cb35b88c304bb0e170639628cba0884 |
SimHash | 2c5932604f32 |
Groups
*
Rule | Path |
---|---|
Disallow | /*/json/ |
Disallow | /~/hws/ |
Disallow | /h5news/ |
Disallow | /~/user/ |
Disallow | /*.aspx |
Disallow | /136749668/ |
Disallow | /6985521/ |
Disallow | /_Incapsula_Resource |
Disallow | /cws/ |
Disallow | /presentation/ |
Disallow | /Presentation/ |
Disallow | /search |
Disallow | /Search |
Disallow | /shop/ |
Disallow | /sitecore/ |
Disallow | /sws/ |
Disallow | /trk/ |
Disallow | /webservices/ |
Allow | /sws/feed/news/* |
Warnings
- 1 invalid line.
Comments