healthiack.com
robots.txt

Robots Exclusion Standard data for healthiack.com

Resource Scan

Scan Details

Site Domain healthiack.com
Base Domain healthiack.com
Scan Status Ok
Last Scan2024-10-03T09:38:24+00:00
Next Scan 2024-10-10T09:38:24+00:00

Last Scan

Scanned2024-10-03T09:38:24+00:00
URL https://healthiack.com/robots.txt
Domain IPs 104.21.58.175, 172.67.162.101
Response IP 172.67.162.101
Found Yes
Hash c20a198310389437bc92c3ee2235d7d98deacc1eae4c6ce7e0df233d1bb3b94e
SimHash 09555f72cd72

Groups

ia_archiver

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

ia_archiver-web.archive.org

Rule Path
Disallow /

*

Rule Path
Disallow /post
Disallow /page
Disallow */page
Disallow /member
Disallow /wp-admin
Disallow /category
Disallow /date
Disallow /go
Disallow /downloads
Disallow /author
Disallow /author/*
Disallow /uncategorized
Disallow /201*
Disallow /202*
Allow /