um-mainz.de
robots.txt

Robots Exclusion Standard data for um-mainz.de

Resource Scan

Scan Details

Site Domain um-mainz.de
Base Domain um-mainz.de
Scan Status Ok
Last Scan2025-11-15T10:22:31+00:00
Next Scan 2025-12-15T10:22:31+00:00

Last Scan

Scanned2025-11-15T10:22:31+00:00
URL https://www.um-mainz.de/robots.txt
Domain IPs 134.93.123.60
Response IP 134.93.123.60
Found Yes
Hash 72c8bf1d07642b70387347e036102a981595d8a66a6d940790e1db36230cce80
SimHash 6108635f1377

Groups

gptbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

claude*

Rule Path
Disallow /

perplexity*

Rule Path
Disallow /

anthropic-ai

Rule Path
Disallow /

cohere*

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /

meta-externalagent
meta-externalfetcher

Rule Path
Disallow /

mistralai*

Rule Path
Disallow /

bigsur.ai

Rule Path
Disallow /

awario
echobot bot
echoboxbot
semrushbot*

Rule Path
Disallow /

bytespider
diffbot
firecrawlagent
aihitbot

Rule Path
Disallow /

*

Rule Path
Disallow *?*type=98
Disallow /botfalle/
Disallow /hilfe_remotezugang/
Disallow /typo3/
Disallow /typo3_src/
Disallow /typo3conf/
Disallow /typo3temp/
Disallow /xmlrpc.php
Disallow /wp-admin/

googlebot-image

Rule Path
Disallow /
Allow /fileadmin/vorlagen/
Allow /fileadmin/vorlagen/portal/aerzteblatt/
Allow /fileadmin/vorlagen/portal/downloads/
Allow /fileadmin/vorlagen/portal/lageplan/
Allow /fileadmin/vorlagen/portal/downloads/presse/
Allow /fileadmin/vorlagen/portal/downloads/veranstaltungen/
Allow /fileadmin/kliniken/portal/inside-um/

Warnings

  • `noindex` is not a known field.