pitan.is
robots.txt
Robots Exclusion Standard data for pitan.is
Resource Scan
Scan Details
Site Domain | pitan.is |
Base Domain | pitan.is |
Scan Status | Ok |
Last Scan | 2024-06-16T04:03:07+00:00 |
Next Scan | 2024-06-30T04:03:07+00:00 |
Last Scan
Scanned | 2024-06-16T04:03:07+00:00 |
URL | https://www.pitan.is/robots.txt |
Domain IPs | 108.138.246.14, 108.138.246.22, 108.138.246.7, 108.138.246.80, 2600:9000:24ba:1600:9:bf35:240:93a1, 2600:9000:24ba:4600:9:bf35:240:93a1, 2600:9000:24ba:8c00:9:bf35:240:93a1, 2600:9000:24ba:8e00:9:bf35:240:93a1, 2600:9000:24ba:a800:9:bf35:240:93a1, 2600:9000:24ba:be00:9:bf35:240:93a1, 2600:9000:24ba:d000:9:bf35:240:93a1, 2600:9000:24ba:ee00:9:bf35:240:93a1 |
Response IP | 3.160.246.55 |
Found | Yes |
Hash | 242d2f546dd09d8b5ec2a555a32ead233521da3d8de288c1b171900ad7548fc0 |
SimHash | a9219531cbeb |
Groups
*
Rule | Path |
---|---|
Disallow | /*/search? |
Disallow | /*/leit? |
Disallow | /_w/ |
Disallow | /inc/ |
Disallow | /lang/ |
Disallow | /lib/ |
Disallow | /local/ |
Disallow | /modules/ |
Disallow | /sql/ |
Disallow | /static/header/ |
Other Records
Field | Value |
---|---|
crawl-delay | 5 |