fu-berlin.de
robots.txt

Robots Exclusion Standard data for fu-berlin.de

Resource Scan

Scan Details

Site Domain fu-berlin.de
Base Domain fu-berlin.de
Scan Status Ok
Last Scan2024-11-05T16:31:47+00:00
Next Scan 2024-11-19T16:31:47+00:00

Last Scan

Scanned2024-11-05T16:31:47+00:00
URL https://fu-berlin.de/robots.txt
Redirect https://www.fu-berlin.de/robots.txt
Redirect Domain www.fu-berlin.de
Redirect Base fu-berlin.de
Domain IPs 160.45.170.10
Redirect IPs 160.45.170.10
Response IP 160.45.170.10
Found Yes
Hash deddbc549ca3d6190db277dec19610882efa5a19890125b11242eff6cf6e837e
SimHash b38d0c8fe4f1

Groups

*

Rule Path
Disallow /_search
Disallow /en/_search
Disallow /campusleben/_search
Disallow /sites/digitale-lehre/_search
Disallow /sites/weiterbildung/_search
Disallow /universitaet/beruf-karriere/_search
Disallow /en/sites/drs/_search
Disallow /sites/drs/_search
Disallow /sites/nachhaltigkeit/handlungsfelder/campus/verwertung_entsorgung/fundgrube/_search
Disallow /sites/forschungsdatenmanagement/_search
Disallow /sites/genome-editing/_search
Disallow /sites/nachhaltigkeit/_search
Disallow /sites/nachhaltigkeit/handlungsfelder/campus/verwertung_entsorgung/_search
Disallow /en/sites/forschungsdatenmanagement/_search
Disallow /sites/eventn/_search
Disallow /sites/PM-it-dienstekatalog/_search

Other Records

Field Value
sitemap https://www.fu-berlin.de/sitemap.xml

Comments

  • See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines:
  • User-Agent: *
  • Disallow: /
  • site: fu-berlin