h-da.de
robots.txt

Robots Exclusion Standard data for h-da.de

Resource Scan

Scan Details

Site Domain h-da.de
Base Domain h-da.de
Scan Status Ok
Last Scan2024-10-28T16:55:00+00:00
Next Scan 2024-11-27T16:55:00+00:00

Last Scan

Scanned2024-10-28T16:55:00+00:00
URL https://h-da.de/robots.txt
Domain IPs 141.100.10.111, 2001:67c:2184:fdfe::111
Response IP 141.100.10.111
Found Yes
Hash 10873aa9a8e75c21124969836eacdbefd5a39846349b02e5bf612e6b67919ada
SimHash 2598630e8f48

Groups

*

Rule Path Comment
Allow / -
Allow /typo3conf/ext/ -
Allow /typo3temp/ -
Disallow /typo3/ -
Disallow /typo3temp/* -
Allow /typo3temp/*.css -
Allow /typo3temp/*.css.*.gzip -
Allow /typo3temp/*.js -
Allow /typo3temp/*.js.*.gzip -
Allow /typo3temp/*.jpg -
Allow /typo3temp/*.gif -
Allow /typo3temp/*.png -
Disallow /*?L=0* -
Disallow /*%26L%3D0* -
Disallow /*?id=* non speaking URLs
Disallow /*%26id%3D* -
Disallow /*?type=98* -
Disallow /*%26type%3D98* -
Disallow /*tx_powermail_pi1 no powermail thanks pages

Comments

  • folders
  • L=0 is the default language
  • parameters