icthealth.org
robots.txt

Robots Exclusion Standard data for icthealth.org

Resource Scan

Scan Details

Site Domain icthealth.org
Base Domain icthealth.org
Scan Status Ok
Last Scan2025-04-11T16:43:19+00:00
Next Scan 2025-04-25T16:43:19+00:00

Last Scan

Scanned2025-04-11T16:43:19+00:00
URL https://icthealth.org/robots.txt
Domain IPs 104.21.50.71, 172.67.158.119, 2606:4700:3033::ac43:9e77, 2606:4700:3036::6815:3247
Response IP 172.67.158.119
Found Yes
Hash e9a330d07ae973e4e620fce47c920e3dcc95d623be8f7ebdcb5a23f9caad34c0
SimHash 2061955767f2

Groups

*

Rule Path
Disallow /~/*
Disallow /admin/*
Disallow /csrf
Disallow /edit/*
Disallow /forms/*
Disallow /honeypot

Other Records

Field Value
sitemap https://icthealth.org/sitemap.xml

Comments

  • Robots file for Internationaal
  • For all user agents out there (shout out)
  • Path of static version of previous website which should not be accessible for search engines
  • Exclude admin, edit and forms
  • Sitemaps