si.is
robots.txt

Robots Exclusion Standard data for si.is

Resource Scan

Scan Details

Site Domain si.is
Base Domain si.is
Scan Status Ok
Last Scan2025-11-29T10:52:19+00:00
Next Scan 2025-12-29T10:52:19+00:00

Last Scan

Scanned2025-11-29T10:52:19+00:00
URL https://www.si.is/robots.txt
Domain IPs 2a05:d018:6f6:8707:d413:a1d3:b8ec:7508, 2a05:d018:6f6:8708:ff3d:5b0c:576f:9b9b, 52.215.3.122, 54.229.12.216
Response IP 54.229.12.216
Found Yes
Hash f9b2d33fe9b502f14ea4fdaf3fea506cc526cb3b4402ceff4ecc3d7b906693a3
SimHash cd381074e35d

Groups

*

Rule Path
Disallow /*.jsp
Disallow /bitar/
Allow /bitar/*.js$
Disallow /view/
Disallow /leit/?q
Disallow /leit?q
Disallow /search/?q
Disallow /search?q

Other Records

Field Value
crawl-delay 5