throskahjalp.is
robots.txt

Robots Exclusion Standard data for throskahjalp.is

Resource Scan

Scan Details

Site Domain throskahjalp.is
Base Domain throskahjalp.is
Scan Status Ok
Last Scan2024-06-17T00:20:16+00:00
Next Scan 2024-07-01T00:20:16+00:00

Last Scan

Scanned2024-06-17T00:20:16+00:00
URL https://throskahjalp.is/robots.txt
Redirect https://www.throskahjalp.is/robots.txt
Redirect Domain www.throskahjalp.is
Redirect Base throskahjalp.is
Domain IPs 2a05:d018:ed6:eb0c:52:49:138:181, 52.49.138.181
Redirect IPs 2600:9000:265c:1200:4:6415:48c0:93a1, 2600:9000:265c:2c00:4:6415:48c0:93a1, 2600:9000:265c:9800:4:6415:48c0:93a1, 2600:9000:265c:aa00:4:6415:48c0:93a1, 2600:9000:265c:ae00:4:6415:48c0:93a1, 2600:9000:265c:bc00:4:6415:48c0:93a1, 2600:9000:265c:d000:4:6415:48c0:93a1, 2600:9000:265c:e00:4:6415:48c0:93a1, 3.163.125.33, 3.163.125.40, 3.163.125.69, 3.163.125.9
Response IP 18.165.171.38
Found Yes
Hash 242d2f546dd09d8b5ec2a555a32ead233521da3d8de288c1b171900ad7548fc0
SimHash a9219531cbeb

Groups

*

Rule Path
Disallow /*/search?
Disallow /*/leit?
Disallow /_w/
Disallow /inc/
Disallow /lang/
Disallow /lib/
Disallow /local/
Disallow /modules/
Disallow /sql/
Disallow /static/header/

Other Records

Field Value
crawl-delay 5