hi.org
robots.txt
Robots Exclusion Standard data for hi.org
Resource Scan
Scan Details
| Site Domain | hi.org |
| Base Domain | hi.org |
| Scan Status | Ok |
| Last Scan | 2025-11-24T22:17:31+00:00 |
| Next Scan | 2025-12-24T22:17:31+00:00 |
Last Scan
| Scanned | 2025-11-24T22:17:31+00:00 |
| URL | https://hi.org/robots.txt |
| Redirect | https://www.hi.org/robots.txt |
| Redirect Domain | www.hi.org |
| Redirect Base | hi.org |
| Domain IPs | 94.125.108.2 |
| Redirect IPs | 104.18.18.2, 104.18.19.2 |
| Response IP | 104.18.18.2 |
| Found | Yes |
| Hash | 01ce825cb6afffed9b0f3b9b398d871847acdd09d118c2279dd0cda2311c11b2 |
| SimHash | 2dc0ca46c710 |
Groups
*
| Rule | Path |
|---|---|
| Disallow | /extenso |
| Allow | /extenso/*.css |
| Allow | /extenso/*.js |
| Disallow | /secure |
| Disallow | /module |
| Disallow | *?country=* |
| Disallow | *?size=* |
| Disallow | *?maxw=* |
| Disallow | *?recent=* |
| Disallow | *?el_type=* |
Other Records
| Field | Value |
|---|---|
| sitemap | https://hi.org/sitemap.xml |
Comments