umweltdatenbank.de
robots.txt

Robots Exclusion Standard data for umweltdatenbank.de

Resource Scan

Scan Details

Site Domain umweltdatenbank.de
Base Domain umweltdatenbank.de
Scan Status Ok
Last Scan2024-09-30T22:35:16+00:00
Next Scan 2024-10-07T22:35:16+00:00

Last Scan

Scanned2024-09-30T22:35:16+00:00
URL https://umweltdatenbank.de/robots.txt
Redirect https://www.umweltdatenbank.de/robots.txt
Redirect Domain www.umweltdatenbank.de
Redirect Base umweltdatenbank.de
Domain IPs 85.13.138.191
Redirect IPs 85.13.138.191
Response IP 85.13.138.191
Found Yes
Hash 6b077b20b9f8ac8a02b7a6175a3ca6cdf037de830058711bc66c10d30c12918b
SimHash c1a1c7c3a6a1

Groups

mediapartners-google*

Rule Path
Disallow

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

mj12bot

Rule Path
Disallow

dotbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

*

Rule Path
Disallow /cgi-bin/
Disallow /cgi-local/
Disallow */drucken.html
Disallow */print.html
Disallow *mailto*
Disallow */*.pdf
Disallow */rss.html
Disallow */stat.php
Disallow */lexikon-weiterleitungen/
Disallow */glossary-redirections/
Disallow /cms/administrator/
Disallow /cms/bin/
Disallow /cms/cache/
Disallow /cms/components/
Disallow /cms/cli/
Disallow /cms/includes/
Disallow /cms/installation/
Disallow /cms/language/
Disallow /cms/layouts/
Disallow /cms/libraries/
Disallow /cms/logs/
Disallow /cms/modules/
Disallow /cms/plugins/
Disallow /cms/tmp/
Disallow /cms/xmlrpc/
Disallow /cms/lexikon/lexikon
Disallow /tpl/*.php
Disallow /cms/component/
Disallow fw.php*

Comments

  • robots.txt Version 3.8.3 UDB
  • User-Agent: MJ12bot
  • Crawl-Delay: 20