htwk-leipzig.de
robots.txt

Robots Exclusion Standard data for htwk-leipzig.de

Resource Scan

Scan Details

Site Domain htwk-leipzig.de
Base Domain htwk-leipzig.de
Scan Status Ok
Last Scan2025-12-05T02:35:07+00:00
Next Scan 2026-01-04T02:35:07+00:00

Last Scan

Scanned2025-12-05T02:35:07+00:00
URL https://htwk-leipzig.de/robots.txt
Redirect https://www.htwk-leipzig.de/robots.txt
Redirect Domain www.htwk-leipzig.de
Redirect Base htwk-leipzig.de
Domain IPs 141.57.5.215
Redirect IPs 141.57.5.215
Response IP 141.57.5.215
Found Yes
Hash 87fbd9aac374a3f90aa427634bf4fa219b8ba9b3fab462f72749a22b792a6a31
SimHash 6619da15ce49

Groups

bytespider

Rule Path
Disallow /

*

Rule Path
Allow /
Disallow /fileadmin/_temp_
Disallow /fileadmin/portal/intranet/
Disallow /fileadmin/user_upload/
Disallow /typo3conf/
Disallow /typo3temp/
Disallow /typo3/
Disallow /piwik/
Disallow /*?id=*
Disallow /*%26id%3D*
Disallow /*?L=0*
Disallow /*%26L%3D0*
Disallow /*?type=98*
Disallow /*%26type%3D98*
Disallow /*?cHash*
Disallow /*?tx_powermail_pi1*
Disallow /*?tx_ttnews

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://stura.htwk-leipzig.de/sitemapindex.xml

Comments

  • Only allow URLs generated with RealURL
  • L=0 is the default language
  • typeNum = 98 is usually the print version.