wir-sagen-ja.com
robots.txt

Robots Exclusion Standard data for wir-sagen-ja.com

Resource Scan

Scan Details

Site Domain wir-sagen-ja.com
Base Domain wir-sagen-ja.com
Scan Status Ok
Last Scan2025-11-16T22:30:44+00:00
Next Scan 2025-11-23T22:30:44+00:00

Last Scan

Scanned2025-11-16T22:30:44+00:00
URL https://wir-sagen-ja.com/robots.txt
Redirect https://www.wir-sagen-ja.com/robots.txt
Redirect Domain www.wir-sagen-ja.com
Redirect Base wir-sagen-ja.com
Domain IPs 156.67.238.94
Redirect IPs 156.67.238.94
Response IP 156.67.238.94
Found Yes
Hash f5560b96eed33a42c3dcd520e7fd5d78df197c027b31753116be9c1315c9332c
SimHash 60493b55d941

Groups

*

Rule Path
Disallow /*?id=*
Disallow /*%26id%3D*
Disallow /*?L=0*
Disallow /*%26L%3D0*
Disallow /*/Private/*
Disallow /*/Configuration/*
Disallow /typo3temp/*
Allow /typo3temp/*.css$
Allow /typo3temp/*.css.*.gzip$
Allow /typo3temp/*.js$
Allow /typo3temp/*.js.*.gzip$
Allow /typo3temp/*.jpg$
Allow /typo3temp/*.gif$
Allow /typo3temp/*.png$
Allow /typo3temp/*.pdf$
Disallow *.sql
Disallow *.sql.gz

Other Records

Field Value
sitemap https://www.example.com/sitemap.xml

Comments

  • Specialized robots.txt for TYPO3
  • See: https://gist.github.com/oliverthiele/83957820413fd981e062
  • Only allow URLs generated with RealURL
  • L=0 is the default language
  • Should always be protected (.htaccess)
  • Disallow temporary files
  • Disallow SQL files
  • Sitemap (Cannot be relative path)