healthmatch.io
robots.txt

Robots Exclusion Standard data for healthmatch.io

Resource Scan

Scan Details

Site Domain healthmatch.io
Base Domain healthmatch.io
Scan Status Ok
Last Scan2024-11-15T02:20:36+00:00
Next Scan 2024-12-15T02:20:36+00:00

Last Scan

Scanned2024-11-15T02:20:36+00:00
URL https://healthmatch.io/robots.txt
Domain IPs 3.230.140.50, 44.196.5.62, 52.21.181.69
Response IP 52.21.181.69
Found Yes
Hash 382d06491734f27cc2ce4abcd1c7e5a389dc5a0ccaa52cf174356c66ad063698
SimHash e6301dd0dd32

Groups

*

Rule Path
Disallow /*/trials/*
Disallow /trials/
Disallow /trials/*/locations
Disallow /login
Disallow /update-details
Disallow /signup
Disallow /signup/*
Disallow /questionnaire/*
Disallow /static-pages/*
Disallow /assets/*
Disallow /healthy

Other Records

Field Value
sitemap https://healthmatch.io/sitemap.xml

Comments

  • https://www.robotstxt.org/robotstxt.html
  • Explicit content removal