4daagse.nl
robots.txt

Robots Exclusion Standard data for 4daagse.nl

Resource Scan

Scan Details

Site Domain 4daagse.nl
Base Domain 4daagse.nl
Scan Status Ok
Last Scan2025-04-12T04:27:16+00:00
Next Scan 2025-04-26T04:27:16+00:00

Last Scan

Scanned2025-04-12T04:27:16+00:00
URL https://4daagse.nl/robots.txt
Redirect https://www.4daagse.nl/robots.txt
Redirect Domain www.4daagse.nl
Redirect Base 4daagse.nl
Domain IPs 104.21.94.102, 172.67.222.89, 2606:4700:3034::ac43:de59, 2606:4700:3037::6815:5e66
Redirect IPs 104.21.94.102, 172.67.222.89, 2606:4700:3034::ac43:de59, 2606:4700:3037::6815:5e66
Response IP 172.67.222.89
Found Yes
Hash e63e618a449320832d64597c9b5c3a6e9e9d6659d64f011158ea02cbeea5c278
SimHash 206db147e7f2

Groups

*

Rule Path
Disallow /~/*
Disallow /admin/*
Disallow /csrf
Disallow /edit/*
Disallow /forms/*
Disallow /honeypot

Other Records

Field Value
sitemap https://www.4daagse.nl/sitemap.xml
sitemap https://www.4daagse.nl/en/sitemap.xml

Comments

  • Robots file for 4Daagse
  • For all user agents out there (shout out)
  • Path of static version of previous website which should not be accessible for search engines
  • Exclude admin, edit and forms
  • Sitemaps