visitguttland.lu
robots.txt

Robots Exclusion Standard data for visitguttland.lu

Resource Scan

Scan Details

Site Domain visitguttland.lu
Base Domain visitguttland.lu
Scan Status Ok
Last Scan2025-12-05T20:11:09+00:00
Next Scan 2025-12-19T20:11:09+00:00

Last Scan

Scanned2025-12-05T20:11:09+00:00
URL https://visitguttland.lu/robots.txt
Redirect https://www.visitguttland.lu/robots.txt
Redirect Domain www.visitguttland.lu
Redirect Base visitguttland.lu
Domain IPs 2a01:4f8:c011:469::1, 49.12.23.185
Redirect IPs 2a01:4f8:c011:469::1, 49.12.23.185
Response IP 49.12.23.185
Found Yes
Hash 0e8fa4341431060f387315c6a840e8915f623166763d4fe9220684715c3db645
SimHash 7d59ff41ece9

Groups

gptbot

Rule Path
Disallow

chatgpt-user/2.0

Rule Path
Disallow

claudebot

Rule Path
Disallow

perplexitybot

Rule Path
Disallow

perplexity-user/1.0

Rule Path
Disallow

google-extended

Rule Path
Disallow

amazonbot

Rule Path
Disallow

duckassistbot

Rule Path
Disallow

mistralai-user/1.0

Rule Path
Disallow

youbot

Rule Path
Disallow

facebookexternalhit

Rule Path
Disallow

facebot

Rule Path
Disallow

*

Rule Path
Disallow
Disallow /*?id=*
Disallow /*%26id%3D*
Disallow /*?L=0*
Disallow /*%26L%3D0*
Disallow /*/Private/*
Disallow /*/Configuration/*
Disallow /typo3temp/var/*
Disallow /typo3/
Disallow *.sql
Disallow *.sql.gz
Disallow /*?*eventDateId=
Disallow *.ics
Disallow /*cHash%3D
Disallow /*map?ident=
Disallow /*IMXTOOLS_ADDRESSBASE
Disallow /*%26theme%3D
Disallow /*.gpx

Other Records

Field Value
sitemap https://www.visitguttland.lu/sitemap.xml
sitemap https://www.visitguttland.lu/de/sitemap.xml
sitemap https://www.visitguttland.lu/fr/sitemap.xml

Comments

  • We do not distinguish between the browsers.
  • Only allow URLs generated with frontend routing
  • L=0 is the default language
  • Should always be protected, but you know...
  • Disallow all files in /typo3temp/var/
  • Disallow all files in /typo3/
  • Disallow all kind of sql files
  • Disallow calendar files