hlcaa.org
robots.txt

Robots Exclusion Standard data for hlcaa.org

Resource Scan

Scan Details

Site Domain hlcaa.org
Base Domain hlcaa.org
Scan Status Ok
Last Scan2025-09-22T16:58:03+00:00
Next Scan 2025-10-06T16:58:03+00:00

Last Scan

Scanned2025-09-22T16:58:03+00:00
URL https://hlcaa.org/robots.txt
Domain IPs 52.74.41.140
Response IP 52.74.41.140
Found Yes
Hash 714b2c295e52fae2d81e79b04a1891d2449668b2e0c1ad4a5cfec46ae0c8823e
SimHash bb50de169e93

Groups

*

Rule Path
Disallow /institutes
Disallow /social_login
Disallow /social_signup
Disallow /switch
Disallow /msopntrck
Disallow /msclcktrck
Disallow /invitation
Disallow /noticeboard
Disallow /profile
Disallow /static/page-not-found.html
Disallow /static/bad-request.html
Disallow /profile/*

Other Records

Field Value
crawl-delay 10