hhu.de
robots.txt

Robots Exclusion Standard data for hhu.de

Resource Scan

Scan Details

Site Domain hhu.de
Base Domain hhu.de
Scan Status Ok
Last Scan2024-10-30T19:58:45+00:00
Next Scan 2024-11-29T19:58:45+00:00

Last Scan

Scanned2024-10-30T19:58:45+00:00
URL https://hhu.de/robots.txt
Redirect https://www.hhu.de/robots.txt
Redirect Domain www.hhu.de
Redirect Base hhu.de
Domain IPs 134.99.128.238
Redirect IPs 134.99.128.238
Response IP 134.99.128.238
Found Yes
Hash 93439df4329c7061727b37e7431b7afbca830996685c6728e1468f03109838f3
SimHash 659d5854da00

Groups

ahrefsbot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

yacybot

Rule Path
Allow /VORSCHAU/*

bingbot

Rule Path
Disallow /die-hhu/kontakt-und-services/zentrale-und-amtliche-bekanntmachungen/*
Disallow /en/about-hhu/contact-and-services/central-and-official-announcements/*

gptbot

Rule Path
Disallow /die-hhu/kontakt-und-services/zentrale-und-amtliche-bekanntmachungen/*
Disallow /en/about-hhu/contact-and-services/central-and-official-announcements/*

*

Rule Path
Disallow /VORSCHAU/*
Disallow /ARCHIV/
Disallow /blochproxy/
Disallow /*?id=*
Disallow /*%26id%3D*
Disallow /*?L=0*
Disallow /*%26L%3D0*
Disallow /*?type=98*
Disallow /*%26type%3D98*
Disallow /*?type=9818*
Disallow /*%26type%3D9818*
Disallow /*?type=151*
Disallow /*%26type%3D151*
Disallow /*/Private/*
Disallow /fileadmin/templates/html/*
Disallow /*/Configuration/*
Disallow /typo3temp/*
Allow /typo3temp/*.css$
Allow /typo3temp/*.css.*.gzip$
Allow /typo3temp/*.js$
Allow /typo3temp/*.js.*.gzip$
Allow /typo3temp/*.jpg$
Allow /typo3temp/*.gif$
Allow /typo3temp/*.png$
Disallow *.sql
Disallow *.sql.gz

Other Records

Field Value
crawl-delay 10

Comments

  • Only allow URLs generated with RealURL
  • L=0 is the default language
  • typeNum = 98 is usually the print version.
  • Should always be protected (.htaccess)
  • nfs v9