web.fu-berlin.de
robots.txt

Robots Exclusion Standard data for web.fu-berlin.de

Resource Scan

Scan Details

Site Domain web.fu-berlin.de
Base Domain fu-berlin.de
Scan Status Ok
Last Scan2025-09-24T10:04:37+00:00
Next Scan 2025-10-08T10:04:37+00:00

Last Scan

Scanned2025-09-24T10:04:37+00:00
URL https://web.fu-berlin.de/robots.txt
Redirect https://www.fu-berlin.de/robots.txt
Redirect Domain www.fu-berlin.de
Redirect Base fu-berlin.de
Domain IPs 130.133.4.198
Redirect IPs 160.45.170.10
Response IP 160.45.170.10
Found Yes
Hash e67a3519b5662ab1eb4c98e7044822e5b712e2d13c8f127c727229f608e202ef
SimHash b3850d8fe471

Groups

*

Rule Path
Disallow /_search
Disallow /en/_search
Disallow /campusleben/_search
Disallow /sites/digitale-lehre/_search
Disallow /sites/weiterbildung/_search
Disallow /en/sites/drs/_search
Disallow /sites/drs/_search
Disallow /sites/nachhaltigkeit/handlungsfelder/campus/verwertung_entsorgung/fundgrube/_search
Disallow /sites/forschungsdatenmanagement/_search
Disallow /sites/genome-editing/_search
Disallow /sites/nachhaltigkeit/_search
Disallow /sites/nachhaltigkeit/handlungsfelder/campus/verwertung_entsorgung/_search
Disallow /en/sites/forschungsdatenmanagement/_search
Disallow /sites/eventn/_search
Disallow /sites/PM-it-dienstekatalog/_search

Other Records

Field Value
sitemap https://www.fu-berlin.de/sitemap.xml

Comments

  • See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines:
  • User-Agent: *
  • Disallow: /
  • site: fu-berlin