hl.is
robots.txt

Robots Exclusion Standard data for hl.is

Resource Scan

Scan Details

Site Domain hl.is
Base Domain hl.is
Scan Status Ok
Last Scan2025-08-28T15:42:11+00:00
Next Scan 2025-09-04T15:42:11+00:00

Last Scan

Scanned2025-08-28T15:42:11+00:00
URL https://hl.is/robots.txt
Domain IPs 2607:f1c0:100f:f000::2de, 74.208.236.215
Response IP 74.208.236.215
Found Yes
Hash c8ef701e20761c8133c69948aeebabc014655cfd753f8b993002b739eeeb29a6
SimHash 43701d562d17

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/

Other Records

Field Value
sitemap https://hl.is/sitemaps-1-sitemap.xml
sitemap https://hl.is/sitemaps-1-sitemap.xml
sitemap https://hl.is/sitemaps-1-sitemap.xml

Comments

  • robots.txt for //hl.is/
  • live - don't allow web crawlers to index cpresources/ or vendor/