south.is
robots.txt
Robots Exclusion Standard data for south.is
Resource Scan
Scan Details
Site Domain | south.is |
Base Domain | south.is |
Scan Status | Ok |
Last Scan | 2024-10-24T14:54:38+00:00 |
Next Scan | 2024-11-07T14:54:38+00:00 |
Last Scan
Scanned | 2024-10-24T14:54:38+00:00 |
URL | https://south.is/robots.txt |
Redirect | https://www.south.is/robots.txt |
Redirect Domain | www.south.is |
Redirect Base | south.is |
Domain IPs | 2a05:d018:ed6:eb0c:52:49:138:181, 52.49.138.181 |
Redirect IPs | 2600:9000:2022:0:3:69d1:bdc0:93a1, 2600:9000:2022:2400:3:69d1:bdc0:93a1, 2600:9000:2022:5600:3:69d1:bdc0:93a1, 2600:9000:2022:6400:3:69d1:bdc0:93a1, 2600:9000:2022:7c00:3:69d1:bdc0:93a1, 2600:9000:2022:b400:3:69d1:bdc0:93a1, 2600:9000:2022:c800:3:69d1:bdc0:93a1, 2600:9000:2022:e000:3:69d1:bdc0:93a1, 54.230.112.126, 54.230.112.128, 54.230.112.33, 54.230.112.5 |
Response IP | 52.85.49.56 |
Found | Yes |
Hash | 242d2f546dd09d8b5ec2a555a32ead233521da3d8de288c1b171900ad7548fc0 |
SimHash | a9219531cbeb |
Groups
*
Rule | Path |
---|---|
Disallow | /*/search? |
Disallow | /*/leit? |
Disallow | /_w/ |
Disallow | /inc/ |
Disallow | /lang/ |
Disallow | /lib/ |
Disallow | /local/ |
Disallow | /modules/ |
Disallow | /sql/ |
Disallow | /static/header/ |
Other Records
Field | Value |
---|---|
crawl-delay | 5 |