sdg.no
robots.txt

Robots Exclusion Standard data for sdg.no

Resource Scan

Scan Details

Site Domain sdg.no
Base Domain sdg.no
Scan Status Ok
Last Scan2025-10-13T05:25:59+00:00
Next Scan 2025-10-20T05:25:59+00:00

Last Scan

Scanned2025-10-13T05:25:59+00:00
URL https://sdg.no/robots.txt
Domain IPs 2a04:3540:1000:310:432:ff:fe79:4f72, 94.237.112.83
Response IP 94.237.112.83
Found Yes
Hash e0e8a9d6cc49f1bda5208b56f47bd0a02f82f7f295905ac952e7878347cf2291
SimHash 412819762792

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/

Other Records

Field Value
sitemap http://sdg.se/sitemaps-1-sitemap.xml

Comments

  • robots.txt for /
  • live - don't allow web crawlers to index cpresources/ or vendor/