throskahjalp.is
robots.txt

Robots Exclusion Standard data for throskahjalp.is

Resource Scan

Scan Details

Site Domain throskahjalp.is
Base Domain throskahjalp.is
Scan Status Ok
Last Scan2024-11-04T04:51:35+00:00
Next Scan 2024-11-18T04:51:35+00:00

Last Scan

Scanned2024-11-04T04:51:35+00:00
URL https://throskahjalp.is/robots.txt
Redirect https://www.throskahjalp.is/robots.txt
Redirect Domain www.throskahjalp.is
Redirect Base throskahjalp.is
Domain IPs 2a05:d018:ed6:eb0c:52:49:138:181, 52.49.138.181
Redirect IPs 2600:9000:21f8:1a00:4:6415:48c0:93a1, 2600:9000:21f8:3600:4:6415:48c0:93a1, 2600:9000:21f8:8200:4:6415:48c0:93a1, 2600:9000:21f8:9800:4:6415:48c0:93a1, 2600:9000:21f8:b000:4:6415:48c0:93a1, 2600:9000:21f8:bc00:4:6415:48c0:93a1, 2600:9000:21f8:ee00:4:6415:48c0:93a1, 2600:9000:21f8:f000:4:6415:48c0:93a1, 3.160.196.10, 3.160.196.28, 3.160.196.3, 3.160.196.96
Response IP 52.85.49.26
Found Yes
Hash 242d2f546dd09d8b5ec2a555a32ead233521da3d8de288c1b171900ad7548fc0
SimHash a9219531cbeb

Groups

*

Rule Path
Disallow /*/search?
Disallow /*/leit?
Disallow /_w/
Disallow /inc/
Disallow /lang/
Disallow /lib/
Disallow /local/
Disallow /modules/
Disallow /sql/
Disallow /static/header/

Other Records

Field Value
crawl-delay 5