hrcaonline.org
robots.txt
Robots Exclusion Standard data for hrcaonline.org
Resource Scan
Scan Details
Site Domain | hrcaonline.org |
Base Domain | hrcaonline.org |
Scan Status | Ok |
Last Scan | 2025-08-09T08:54:14+00:00 |
Next Scan | 2025-09-08T08:54:14+00:00 |
Last Scan
Scanned | 2025-08-09T08:54:14+00:00 |
URL | https://hrcaonline.org/robots.txt |
Domain IPs | 104.26.4.17, 104.26.5.17, 172.67.73.94, 2606:4700:20::681a:411, 2606:4700:20::681a:511, 2606:4700:20::ac43:495e |
Response IP | 172.67.73.94 |
Found | Yes |
Hash | 2826059db5135d16b87e8c8d77022d6780d17075d65649b0dc52bf6d0aa1846f |
SimHash | 390d1943a5c6 |
Groups
*
Rule | Path | Comment |
---|---|---|
Disallow | /*/ctl/ | Googlebot permits * |
Disallow | /admin/ | - |
Disallow | /App_Browsers/ | - |
Disallow | /App_Code/ | - |
Disallow | /App_Data/ | - |
Disallow | /App_GlobalResources/ | - |
Disallow | /bin/ | - |
Disallow | /Components/ | - |
Disallow | /Config/ | - |
Disallow | /contest/ | - |
Disallow | /controls/ | - |
Disallow | /Documentation/ | - |
Disallow | /HttpModules/ | - |
Disallow | /Install/ | - |
Disallow | /Providers/ | - |
Disallow | /Activity-Feed/userId/ | Do not index user profiles |
Comments