allerwelt-lexikon.de
robots.txt

Robots Exclusion Standard data for allerwelt-lexikon.de

Resource Scan

Scan Details

Site Domain allerwelt-lexikon.de
Base Domain allerwelt-lexikon.de
Scan Status Ok
Last Scan2024-09-29T10:43:08+00:00
Next Scan 2024-10-06T10:43:08+00:00

Last Scan

Scanned2024-09-29T10:43:08+00:00
URL https://allerwelt-lexikon.de/robots.txt
Redirect https://www.allerwelt-lexikon.de/robots.txt
Redirect Domain www.allerwelt-lexikon.de
Redirect Base allerwelt-lexikon.de
Domain IPs 188.40.73.119
Redirect IPs 188.40.73.119
Response IP 188.40.73.119
Found Yes
Hash b72d2b7b39627fc20879542dcdd41d48b08ce966f14afd7604a470be6bf56f26
SimHash c5a1c5c3a6a1

Groups

mediapartners-google*

Rule Path
Disallow

*

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

mj12bot

Rule Path
Disallow

dotbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

slurp

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

*

Rule Path
Disallow /cgi-bin/
Disallow /cgi-local/
Disallow */drucken.html
Disallow */print.html
Disallow *mailto*
Disallow */*.pdf
Disallow */rss.html
Disallow */stat.php
Disallow */lexikon-weiterleitungen/
Disallow */glossary-redirections/
Disallow /cms/administrator/
Disallow /cms/bin/
Disallow /cms/cache/
Disallow /cms/components/
Disallow /cms/cli/
Disallow /cms/includes/
Disallow /cms/installation/
Disallow /cms/language/
Disallow /cms/layouts/
Disallow /cms/libraries/
Disallow /cms/logs/
Disallow /cms/modules/
Disallow /cms/plugins/
Disallow /cms/tmp/
Disallow /cms/xmlrpc/
Disallow /cms/lexikon/lexikon
Disallow /tpl/*.php

Comments

  • robots.txt Version 3.8.3 AW
  • User-Agent: MJ12bot
  • Crawl-Delay: 20