heilsutorg.is
robots.txt

Robots Exclusion Standard data for heilsutorg.is

Resource Scan

Scan Details

Site Domain heilsutorg.is
Base Domain heilsutorg.is
Scan Status Ok
Last Scan2024-10-31T23:12:04+00:00
Next Scan 2024-11-14T23:12:04+00:00

Last Scan

Scanned2024-10-31T23:12:04+00:00
URL https://heilsutorg.is/robots.txt
Redirect https://www.heilsutorg.is/robots.txt
Redirect Domain www.heilsutorg.is
Redirect Base heilsutorg.is
Domain IPs 2a05:d018:ed6:eb0c:52:49:138:181, 52.49.138.181
Redirect IPs 13.35.210.104, 13.35.210.122, 13.35.210.6, 13.35.210.63, 2600:9000:2078:1000:1e:456:7dc0:93a1, 2600:9000:2078:4e00:1e:456:7dc0:93a1, 2600:9000:2078:5c00:1e:456:7dc0:93a1, 2600:9000:2078:600:1e:456:7dc0:93a1, 2600:9000:2078:6a00:1e:456:7dc0:93a1, 2600:9000:2078:9e00:1e:456:7dc0:93a1, 2600:9000:2078:a000:1e:456:7dc0:93a1, 2600:9000:2078:b600:1e:456:7dc0:93a1
Response IP 13.35.210.122
Found Yes
Hash 242d2f546dd09d8b5ec2a555a32ead233521da3d8de288c1b171900ad7548fc0
SimHash a9219531cbeb

Groups

*

Rule Path
Disallow /*/search?
Disallow /*/leit?
Disallow /_w/
Disallow /inc/
Disallow /lang/
Disallow /lib/
Disallow /local/
Disallow /modules/
Disallow /sql/
Disallow /static/header/

Other Records

Field Value
crawl-delay 5