hi.org
robots.txt

Robots Exclusion Standard data for hi.org

Resource Scan

Scan Details

Site Domain hi.org
Base Domain hi.org
Scan Status Ok
Last Scan2025-11-24T22:17:31+00:00
Next Scan 2025-12-24T22:17:31+00:00

Last Scan

Scanned2025-11-24T22:17:31+00:00
URL https://hi.org/robots.txt
Redirect https://www.hi.org/robots.txt
Redirect Domain www.hi.org
Redirect Base hi.org
Domain IPs 94.125.108.2
Redirect IPs 104.18.18.2, 104.18.19.2
Response IP 104.18.18.2
Found Yes
Hash 01ce825cb6afffed9b0f3b9b398d871847acdd09d118c2279dd0cda2311c11b2
SimHash 2dc0ca46c710

Groups

*

Rule Path
Disallow /extenso
Allow /extenso/*.css
Allow /extenso/*.js
Disallow /secure
Disallow /module
Disallow *?country=*
Disallow *?size=*
Disallow *?maxw=*
Disallow *?recent=*
Disallow *?el_type=*

adsbot-google

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

adidxbot

Rule Path
Disallow

mediapartners-google

Rule Path
Disallow

Other Records

Field Value
sitemap https://hi.org/sitemap.xml

Comments

  • Url-parametres