legacywholistichealth.com
robots.txt

Robots Exclusion Standard data for legacywholistichealth.com

Resource Scan

Scan Details

Site Domain legacywholistichealth.com
Base Domain legacywholistichealth.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-08-07T03:15:55+00:00
Next Scan 2024-11-05T03:15:55+00:00

Last Successful Scan

Scanned2023-07-15T03:09:51+00:00
URL https://legacywholistichealth.com/robots.txt
Redirect https://www.legacywholistichealth.com/robots.txt
Redirect Domain www.legacywholistichealth.com
Redirect Base legacywholistichealth.com
Domain IPs 104.21.56.48, 172.67.177.144, 2606:4700:3031::ac43:b190, 2606:4700:3036::6815:3830
Redirect IPs 104.21.56.48, 172.67.177.144, 2606:4700:3031::ac43:b190, 2606:4700:3036::6815:3830
Response IP 172.67.177.144
Found Yes
Hash 85b2a8e565903f4450ca0de850be19e15c1d50c5939583f4d719f92a876518fa
SimHash 8800d8466f12

Groups

*

Rule Path
Disallow

googlebot

Rule Path
Disallow

mediapartners-google

Rule Path
Disallow

netseer

Rule Path
Disallow /

utorrent

Rule Path
Disallow /

baidu

Rule Path
Disallow /

optimizationcrawler

Rule Path
Disallow /

urllib

Rule Path
Disallow /

Other Records

Field Value
sitemap https://hun.legacywholistichealth.com/sitemap.xml

Warnings

  • `host` is not a known field.