hwk-koeln.de
robots.txt

Robots Exclusion Standard data for hwk-koeln.de

Resource Scan

Scan Details

Site Domain hwk-koeln.de
Base Domain hwk-koeln.de
Scan Status Ok
Last Scan2026-02-24T20:24:22+00:00
Next Scan 2026-03-10T20:24:22+00:00

Last Scan

Scanned2026-02-24T20:24:22+00:00
URL https://hwk-koeln.de/robots.txt
Redirect https://www.hwk-koeln.de/robots.txt
Redirect Domain www.hwk-koeln.de
Redirect Base hwk-koeln.de
Domain IPs 176.95.69.164
Redirect IPs 176.95.69.164
Response IP 176.95.69.164
Found Yes
Hash 8296db57fd246ec61856f3c09c3e6035f37243c52d7cae33234bd8c88ef74fc1
SimHash 20a5dd407790

Groups

googlebot
adsbot-google
googlebot-image

Rule Path
Disallow

*

Rule Path
Disallow /downloads/
Disallow /export/
Disallow /ftp/
Disallow /images/
Disallow /metrics/
Disallow /scripts/
Disallow /style/
Disallow /temp/
Disallow /tmp/
Disallow /upload/
Disallow /*?op=print$

Other Records

Field Value
sitemap https://www.hwk-koeln.de/sitemap.xml