www.mgt.tum.de
robots.txt

Robots Exclusion Standard data for www.mgt.tum.de

Resource Scan

Scan Details

Site Domain www.mgt.tum.de
Base Domain tum.de
Scan Status Ok
Last Scan2025-09-12T13:59:13+00:00
Next Scan 2025-10-12T13:59:13+00:00

Last Scan

Scanned2025-09-12T13:59:13+00:00
URL https://www.mgt.tum.de/robots.txt
Domain IPs 138.246.224.229, 2001:4ca0:800::8af6:e0e5
Response IP 138.246.224.229
Found Yes
Hash e689884339b032a0983e8e78a7d0028f300796aff7c71f1223b6df32e3d4cb10
SimHash a9484152dbd1

Groups

*

Rule Path Comment
Allow / -
Disallow /typo3/ -
Disallow /typo3conf/ -
Allow /_assets/ -
Allow /typo3temp/ -
Disallow /search/* -
Disallow /*tx_form_formframework no forms

Other Records

Field Value
sitemap https://www.mgt.tum.de/sitemap.xml
sitemap https://www.mgt.tum.de/sitemap.xml?sitemap=pages&cHash=85b35d5df31dece3b4ca15ac7a5671d3
sitemap https://www.mgt.tum.de/sitemap.xml?sitemap=news&cHash=7a6e0800b7705eefdbe30d37cd25f357

Comments

  • folders
  • Don't index the search folder
  • parameters
  • sitemap