hi-us.org
robots.txt

Robots Exclusion Standard data for hi-us.org

Resource Scan

Scan Details

Site Domain hi-us.org
Base Domain hi-us.org
Scan Status Ok
Last Scan2025-10-08T07:55:04+00:00
Next Scan 2025-11-07T07:55:04+00:00

Last Scan

Scanned2025-10-08T07:55:04+00:00
URL https://hi-us.org/robots.txt
Redirect https://www.hi-us.org/robots.txt
Redirect Domain www.hi-us.org
Redirect Base hi-us.org
Domain IPs 94.125.108.2
Redirect IPs 104.18.10.151, 104.18.11.151, 2606:4700::6812:a97, 2606:4700::6812:b97
Response IP 104.18.10.151
Found Yes
Hash 7a36c0ab98b2409cb9bb7c23cbbfa1ebb03cadceaeb21665ad2039b904f13d55
SimHash a160e8cbc510

Groups

*

Rule Path
Disallow *?&country=*
Disallow *?country=*
Disallow *?size=*
Disallow *?maxw=*
Disallow *?recent=*
Disallow *?el_type=*
Disallow */search?kw=*
Disallow /extenso
Allow /extenso/*.css
Allow /extenso/*.js
Disallow /secure
Disallow /module

adsbot-google

Rule Path
Disallow

googlebot-image

Rule Path
Disallow

adidxbot

Rule Path
Disallow

mediapartners-google

Rule Path
Disallow

Other Records

Field Value
sitemap https://www.hi-us.org/sitemap.xml

Comments

  • URL Parameters
  • Extenso
  • SEA et images
  • Sitemap