safe-haven.dk
robots.txt

Robots Exclusion Standard data for safe-haven.dk

Resource Scan

Scan Details

Site Domain safe-haven.dk
Base Domain safe-haven.dk
Scan Status Ok
Last Scan2025-03-07T15:24:42+00:00
Next Scan 2025-04-06T15:24:42+00:00

Last Scan

Scanned2025-03-07T15:24:42+00:00
URL https://safe-haven.dk/robots.txt
Domain IPs 93.191.155.253
Response IP 93.191.155.253
Found Yes
Hash 157e822ee930078ccf98df7a9440906be7b810b3750e9ae88f605ed1e343731e
SimHash dd9401036cf1

Groups

*

Rule Path
Allow /
Disallow /misc/
Disallow /t3lib/
Disallow /typo3/
Disallow /typo3conf/
Disallow /no_cache/
Disallow /faste-sideelementer/
Disallow /faste-elementer/
Disallow /404/
Disallow /?id=105
Disallow /?id=992
Disallow /*%26type%3D98

Comments

  • Allow bot to enter
  • Exclude only folders with no
  • link from frontend, like
  • templates, css, js.
  • Disable non-realurl
  • Disallow: /*?id=*
  • Disable print pages
  • Your Sitemap
  • Sitemap: http://safe-haven.dk/?eID=dd_googlesitemap
  • Your RSS Feed
  • Sitemap: http://www.example.tld/rss.xml