hetoog.be
robots.txt

Robots Exclusion Standard data for hetoog.be

Resource Scan

Scan Details

Site Domain hetoog.be
Base Domain hetoog.be
Scan Status Ok
Last Scan2025-10-27T04:18:20+00:00
Next Scan 2025-11-03T04:18:20+00:00

Last Scan

Scanned2025-10-27T04:18:20+00:00
URL https://hetoog.be/robots.txt
Redirect https://www.hetoog.be/robots.txt
Redirect Domain www.hetoog.be
Redirect Base hetoog.be
Domain IPs 185.18.9.220
Redirect IPs 185.18.9.220
Response IP 185.18.9.220
Found Yes
Hash a79754cfe35f4d5c2b4b92850bdb690b4469f52eb66388d292d687fbd8da2378
SimHash 43701d562597

Groups

*

Rule Path
Disallow /cpresources/
Disallow /vendor/
Disallow /.env
Disallow /cache/

Other Records

Field Value
sitemap https://www.hetoog.be/nl/sitemaps-1-sitemap.xml

Comments

  • robots.txt for /nl/
  • live - don't allow web crawlers to index cpresources/ or vendor/