ihi.is
robots.txt

Robots Exclusion Standard data for ihi.is

Resource Scan

Scan Details

Site Domain ihi.is
Base Domain ihi.is
Scan Status Ok
Last Scan2024-04-30T05:02:22+00:00
Next Scan 2024-05-14T05:02:22+00:00

Last Scan

Scanned2024-04-30T05:02:22+00:00
URL https://ihi.is/robots.txt
Redirect https://www.ihi.is/robots.txt
Redirect Domain www.ihi.is
Redirect Base ihi.is
Domain IPs 2a05:d018:ed6:eb0c:52:49:138:181, 52.49.138.181
Redirect IPs 2600:9000:2201:1800:6:4f00:cec0:93a1, 2600:9000:2201:2200:6:4f00:cec0:93a1, 2600:9000:2201:2c00:6:4f00:cec0:93a1, 2600:9000:2201:3a00:6:4f00:cec0:93a1, 2600:9000:2201:5c00:6:4f00:cec0:93a1, 2600:9000:2201:8200:6:4f00:cec0:93a1, 2600:9000:2201:8e00:6:4f00:cec0:93a1, 2600:9000:2201:9800:6:4f00:cec0:93a1, 3.163.24.28, 3.163.24.52, 3.163.24.79, 3.163.24.83
Response IP 3.160.246.22
Found Yes
Hash 242d2f546dd09d8b5ec2a555a32ead233521da3d8de288c1b171900ad7548fc0
SimHash a9219531cbeb

Groups

*

Rule Path
Disallow /*/search?
Disallow /*/leit?
Disallow /_w/
Disallow /inc/
Disallow /lang/
Disallow /lib/
Disallow /local/
Disallow /modules/
Disallow /sql/
Disallow /static/header/

Other Records

Field Value
crawl-delay 5