www.geschkult.fu-berlin.de
robots.txt

Robots Exclusion Standard data for www.geschkult.fu-berlin.de

Archived Snapshots

Resource Scan

Scan Details

Site Domain	www.geschkult.fu-berlin.de
Base Domain	fu-berlin.de
Scan Status	Ok
Last Scan	2025-10-16T15:08:18+00:00
Next Scan	2025-11-15T15:08:18+00:00

Last Scan

Scanned	2025-10-16T15:08:18+00:00
URL	https://www.geschkult.fu-berlin.de/robots.txt
Domain IPs	160.45.170.14
Response IP	160.45.170.14
Found	Yes
Hash	fe9a88ccbb5e54d7394ace8bb8991d189f6dcf930978a5c77ea8fee4c3cfc142
SimHash	d6190d0de554

Groups

*

Rule	Path
Disallow	/_search
Disallow	/en/_search
Disallow	/fr/_search
Disallow	/es/_search
Disallow	/pt/_search
Disallow	/zh/_search
Disallow	/e/fmi/bereiche/ab_nolte/Projekte/hoer-wissen-im-wandel/_search
Disallow	/en/e/fmi/bereiche/ab_nolte/Projekte/hoer-wissen-im-wandel/_search
Disallow	/e/tongilbu/_search
Disallow	/e/provenienzforschung/_search
Disallow	/e/praehist/ella/_search
Disallow	/ar/e/semiarab/_search
Disallow	/e/oas/sinologie/forschung/Forschungsschwerpunkte/Linguistik/PM-Ressourcenpool-ChaF/_search
Disallow	/e/oas/sinologie/forschung/projekte/bibliochaf/_search

Rule

Path

Disallow

/_search

Disallow

/en/_search

Disallow

/fr/_search

Disallow

/es/_search

Disallow

/pt/_search

Disallow

/zh/_search

Disallow

/e/fmi/bereiche/ab_nolte/Projekte/hoer-wissen-im-wandel/_search

Disallow

/en/e/fmi/bereiche/ab_nolte/Projekte/hoer-wissen-im-wandel/_search

Disallow

/e/tongilbu/_search

Disallow

/e/provenienzforschung/_search

Disallow

/e/praehist/ella/_search

Disallow

/ar/e/semiarab/_search

Disallow

/e/oas/sinologie/forschung/Forschungsschwerpunkte/Linguistik/PM-Ressourcenpool-ChaF/_search

Disallow

/e/oas/sinologie/forschung/projekte/bibliochaf/_search

Back to top

Other Records

Field	Value
sitemap	https://www.geschkult.fu-berlin.de/sitemap.xml

Field

Value

sitemap

https://www.geschkult.fu-berlin.de/sitemap.xml

Back to top

Comments

See http://www.robotstxt.org/wc/norobots.html for documentation on how to use the robots.txt file
To ban all spiders from the entire site uncomment the next two lines:
User-Agent: *
Disallow: /
site: geschkult

Back to top

www.geschkult.fu-berlin.derobots.txt

Resource Scan

Scan Details

Last Scan

Groups

*

Other Records

Comments

www.geschkult.fu-berlin.de
robots.txt