bau.de
robots.txt

Robots Exclusion Standard data for bau.de

Resource Scan

Scan Details

Site Domain bau.de
Base Domain bau.de
Scan Status Ok
Last Scan2026-01-09T11:50:17+00:00
Next Scan 2026-01-16T11:50:17+00:00

Last Scan

Scanned2026-01-09T11:50:17+00:00
URL https://bau.de/robots.txt
Domain IPs 2001:8d8:100f:f000::2c5, 217.160.0.42
Response IP 217.160.0.42
Found Yes
Hash 8c3e1ef48bc1589cf1083612772b2e8120bb1acfc3b47bc1a0a14816107af27c
SimHash 6c0958217394

Groups

*

Rule Path Comment
Disallow /tmp/ -
Disallow /a.php -
Disallow /e.php -
Disallow /ie.php -
Disallow /impressum.php -
Disallow /allgemein/ -
Allow /allgemein/partner.php -
Disallow /suche/ -
Disallow /pgm/ -
Disallow /pgm/forum/edit.php -
Allow /*thema-*?* -
Disallow /*?* -
Allow /* :~:text=*

Comments

  • robots.txt - Steuerung des Crawlings für Bots
  • ============================
  • Standardregel für alle anderen Bots
  • ============================
  • Spezielle Bereiche blockieren
  • Ausnahme: URLs mit "thema-" dürfen Parameter enthalten
  • Alle anderen URLs mit Parametern blockieren
  • Text Fragments erlauben (Chrome-Feature zum Text-Highlighting)