nutraholic.com
robots.txt

Robots Exclusion Standard data for nutraholic.com

Resource Scan

Scan Details

Site Domain nutraholic.com
Base Domain nutraholic.com
Scan Status Ok
Last Scan2025-06-04T15:33:46+00:00
Next Scan 2025-07-04T15:33:46+00:00

Last Scan

Scanned2025-06-04T15:33:46+00:00
URL https://nutraholic.com/robots.txt
Redirect https://www.nutraholic.com/robots.txt
Redirect Domain www.nutraholic.com
Redirect Base nutraholic.com
Domain IPs 104.21.56.30, 172.67.176.27, 2606:4700:3032::ac43:b01b, 2606:4700:3035::6815:381e
Redirect IPs 104.21.56.30, 172.67.176.27, 2606:4700:3032::ac43:b01b, 2606:4700:3035::6815:381e
Response IP 104.21.56.30
Found Yes
Hash eb574dbd38783aba3b920a19920e0bb0f879d32fc2a3988af6ae41e12fb112dd
SimHash 9a75c1124b45

Groups

*

Rule Path
Disallow

*

Rule Path
Disallow /admin

*

Rule Path
Disallow /cfg

*

Rule Path
Disallow /uf/sales-report

*

Rule Path
Disallow /uf/csv

*

Rule Path
Disallow /lib

*

Rule Path
Disallow /inc

archive.org_bot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

ia_archiver-web.archive.org

Rule Path
Disallow /

Comments

  • All robots will spider the domain
  • Disallow directory admin
  • Disallow directory includes
  • Disallow directory includes
  • Disallow directory includes
  • Disallow directory includes
  • Disallow directory includes