south.is
robots.txt

Robots Exclusion Standard data for south.is

Resource Scan

Scan Details

Site Domain south.is
Base Domain south.is
Scan Status Ok
Last Scan2024-10-24T14:54:38+00:00
Next Scan 2024-11-07T14:54:38+00:00

Last Scan

Scanned2024-10-24T14:54:38+00:00
URL https://south.is/robots.txt
Redirect https://www.south.is/robots.txt
Redirect Domain www.south.is
Redirect Base south.is
Domain IPs 2a05:d018:ed6:eb0c:52:49:138:181, 52.49.138.181
Redirect IPs 2600:9000:2022:0:3:69d1:bdc0:93a1, 2600:9000:2022:2400:3:69d1:bdc0:93a1, 2600:9000:2022:5600:3:69d1:bdc0:93a1, 2600:9000:2022:6400:3:69d1:bdc0:93a1, 2600:9000:2022:7c00:3:69d1:bdc0:93a1, 2600:9000:2022:b400:3:69d1:bdc0:93a1, 2600:9000:2022:c800:3:69d1:bdc0:93a1, 2600:9000:2022:e000:3:69d1:bdc0:93a1, 54.230.112.126, 54.230.112.128, 54.230.112.33, 54.230.112.5
Response IP 52.85.49.56
Found Yes
Hash 242d2f546dd09d8b5ec2a555a32ead233521da3d8de288c1b171900ad7548fc0
SimHash a9219531cbeb

Groups

*

Rule Path
Disallow /*/search?
Disallow /*/leit?
Disallow /_w/
Disallow /inc/
Disallow /lang/
Disallow /lib/
Disallow /local/
Disallow /modules/
Disallow /sql/
Disallow /static/header/

Other Records

Field Value
crawl-delay 5