ictandhealth.com
robots.txt

Robots Exclusion Standard data for ictandhealth.com

Resource Scan

Scan Details

Site Domain ictandhealth.com
Base Domain ictandhealth.com
Scan Status Ok
Last Scan2025-05-23T16:39:06+00:00
Next Scan 2025-06-06T16:39:06+00:00

Last Scan

Scanned2025-05-23T16:39:06+00:00
URL https://ictandhealth.com/robots.txt
Redirect https://icthealth.org/robots.txt
Redirect Domain icthealth.org
Redirect Base icthealth.org
Domain IPs 104.21.28.36, 172.67.144.13, 2606:4700:3030::ac43:900d, 2606:4700:3034::6815:1c24
Redirect IPs 104.21.50.71, 172.67.158.119, 2606:4700:3033::ac43:9e77, 2606:4700:3036::6815:3247
Response IP 104.21.50.71
Found Yes
Hash 9bb66b32c0625856eb387606e4d9150359b28574007ca7262591f9ab3ed5b2fd
SimHash 2061955767f2

Groups

*

Rule Path
Disallow /~/*
Disallow /admin/*
Disallow /csrf
Disallow /edit/*
Disallow /forms/*
Disallow /honeypot

Other Records

Field Value
sitemap https://icthealth.org/sitemap.xml

Comments

  • Robots file for Global
  • For all user agents out there (shout out)
  • Path of static version of previous website which should not be accessible for search engines
  • Exclude admin, edit and forms
  • Sitemaps