kulturbox.de
robots.txt

Robots Exclusion Standard data for kulturbox.de

Resource Scan

Scan Details

Site Domain kulturbox.de
Base Domain kulturbox.de
Scan Status Ok
Last Scan2024-09-16T15:56:17+00:00
Next Scan 2024-10-16T15:56:17+00:00

Last Scan

Scanned2024-09-16T15:56:17+00:00
URL https://kulturbox.de/robots.txt
Redirect https://www.kulturbox.de/robots.txt
Redirect Domain www.kulturbox.de
Redirect Base kulturbox.de
Domain IPs 217.115.145.95
Redirect IPs 217.115.145.95
Response IP 217.115.145.95
Found Yes
Hash 2b88b3e01677411732fcf07fb4d22f36edd155c1a5e4cd7bef077d9056266a62
SimHash fb3408c3c152

Groups

*

Rule Path
Disallow */wc.dll

www.integromedb.org/crawler

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

aboutusbot

Rule Path
Disallow /

awariorssbot

Rule Path
Disallow /

awariosmartbot

Rule Path
Disallow /

academicbotrtu

Rule Path
Disallow /

admantx-euastn

Rule Path
Disallow /

adsbot

Rule Path
Disallow /

adstxtlab.com crawler

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

amazonbot

Rule Path
Disallow /buehnen/
Disallow /museen/
Disallow /galerien/
Disallow /kuenstler/
Disallow /meine-kulturbox/
Disallow /wc.dll

Other Records

Field Value
crawl-delay 3600

archive.org_bot

Rule Path
Disallow /

archiveteam archivebot

Rule Path
Disallow /

auskunftbot/1.0

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

backlinkcrawler

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

bubing

Rule Path
Disallow /

buibui-bot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

coccocbot-web

Rule Path
Disallow /

crawler_eb_germany

Rule Path
Disallow /

dingobot

Rule Path
Disallow /

seznambot/3.2

Rule Path
Disallow /

xaxissemanticsclassifier/1.0

Rule Path
Disallow /

lcc

Rule Path
Disallow /

daum/4.1

Rule Path
Disallow /

dataprovider

Rule Path
Disallow /

eright/1.0

Rule Path
Disallow /

eright

Rule Path
Disallow /

neevabot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

tweetmemebot/4.0

Rule Path
Disallow /

tweetmemebot

Rule Path
Disallow /

domain re-animator bot

Rule Path
Disallow /

domaincrawler/3.0

Rule Path
Disallow /

domainstatsbot

Rule Path
Disallow /

dataforseobot

Rule Path
Disallow /

imagesiftbot

Rule Path
Disallow /

electricmonk

Rule Path
Disallow /

mindupbot

Rule Path
Disallow /

exabot/3.0

Rule Path
Disallow /

extlinksbot/1.5

Rule Path
Disallow /

extlinksbot

Rule Path
Disallow /

garlikcrawler/1.2

Rule Path
Disallow /

domainappender /1.0

Rule Path
Disallow /

hybridbot

Rule Path
Disallow /

lcc

Rule Path
Disallow /

spiderling

Rule Path
Disallow /

linkpadbot/1.07

Rule Path
Disallow /

linkpadbot

Rule Path
Disallow /

lightspeedsystemscrawler

Rule Path
Disallow /

netseer

Rule Path
Disallow /

netseer crawler/2.0

Rule Path
Disallow /

nutch

Rule Path
Disallow /

ncbot

Rule Path
Disallow /

mauibot

Rule Path
Disallow /

megaindex.ru/2.0

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

megaindex

Rule Path
Disallow /

psbot

Rule Path
Disallow /

reap-crawler

Rule Path
Disallow /

twiceler

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

gptbot/1.0

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

spbot/5.0.3

Rule Path
Disallow /

spbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

siteexplorer

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

meteorarobot

Rule Path
Disallow /
Disallow /

myonid

Rule Path
Disallow /

eventax/1.0

Rule Path
Disallow /

gigabot/1.0

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

getintentcrawler getintent.com

Rule Path
Disallow /

gluten free crawler/1.0

Rule Path
Disallow /

gluten free crawler

Rule Path
Disallow /

heritrix

Rule Path
Disallow /

ichiro/3.0

Rule Path
Disallow /

ichiro

Rule Path
Disallow /

jobboersebot
jooblebot/2.0

Rule Path
Disallow /

jakarta commons-httpclient

Rule Path
Disallow /

ltx71

Rule Path
Disallow /

linguee

Rule Path
Disallow /

paqlebot

Rule Path
Disallow /

paqlebot/2.0

Rule Path
Disallow /

pixraybot

Rule Path
Disallow /

pipl

Rule Path
Disallow /

proximic

Rule Path
Disallow /

pubmatic

Rule Path
Disallow /

quantcastbot/1.0

Rule Path
Disallow /

re-re studio

Rule Path
Disallow /

riddler

Rule Path
Disallow /

scopia crawler 1.0

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

shopwiki

Rule Path
Disallow /

similarpages/nutch-1.0-dev

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

semager

Rule Path
Disallow /

seekport crawler

Rule Path
Disallow /

seekportbot

Rule Path
Disallow /

zoombot

Rule Path
Disallow /

linkbot 1.0

Rule Path
Disallow /

seoscanners.net/1

Rule Path
Disallow /

seobilitybot

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

seokicks

Rule Path
Disallow /

serpstatbot/1.0

Rule Path
Disallow /

smtbot

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

surveybot

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

infotiger crawler

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

uptimebot

Rule Path
Disallow /

voltron

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

vegi bot

Rule Path
Disallow /

www.wevika.de

Rule Path
Disallow /

netestate ne crawler

Rule Path
Disallow /

websitewiki

Rule Path
Disallow /

web-archive-net.com

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

xovibot

Rule Path
Disallow /

yacybot

Rule Path
Disallow /

yandex

Rule Path
Disallow /buehnen/
Disallow /museen/
Disallow /galerien/
Disallow /kuenstler/
Disallow /meine-kulturbox/
Disallow /wc.dll

Other Records

Field Value
crawl-delay 3600

yasni

Rule Path
Disallow /

yeti

Rule Path
Disallow /

y!j-asr/0.1

Rule Path
Disallow /

zoominfobot

Rule Path
Disallow /

*

Rule Path
Disallow /cgi-bin/
Disallow /scripts/
Disallow /tmp/
Disallow /wc.dll
Disallow /museen/wc.dll
Disallow /galerien/wc.dll
Disallow /buehnen/wc.dll
Disallow /kuenstler/wc.dll
Disallow /meine-kulturbox/

Comments

  • robots.txt for https://www.kulturbox.de
  • file created: 07.08.00
  • file changed: 23.09.2022
  • https://academicbot.rtu.lv; mailto:caps@rtu.lv
  • http://www.admantx.com/service-fetcher.html
  • http://ahrefs.com/robot/
  • https://developer.amazon.com/de/support/amazonbot
  • https://archive.org/details/archive.org_bot
  • https://www.auskunft.de
  • http://www.backlinktest.com/crawler.html
  • http://www.exensa.com/crawl
  • Barkrowler/0.7
  • BLEXBot/1.0; +http://webmeup-crawler.com/
  • http://law.di.unimi.it/BUbiNG.htm
  • BuiBui-Bot/1.0 (3m4il: buibui[dot]bot[à7]moquadv[dot]com)
  • DingoBot (http://search.subinsb.com/about/bot.php
  • http://napoveda.seznam.cz/en/seznambot-intro/) -
  • XaxisSemanticsClassifier/1.0 http://crystalsemantics.com
  • http://corpora.informatik.uni-leipzig.de/crawler_faq.html
  • 203.133.174.12 http://cs.daum.net/faq/15/4118.html?faqId=28966
  • eright/1.0; +bot@eright.com
  • eright/1.0; +bot@eright.com
  • TweetmemeBot/4.0; +http://datasift.com/bot.html
  • 185.20.6.86
  • TweetmemeBot/4.0; +http://datasift.com/bot.html
  • http://domainreanimator.com
  • 185.6.8.3 info@domaincrawler.com; http://www.domaincrawler.com/
  • https://domainstats.com/pages/our-bot
  • https://dataforseo.com/dataforseo-bot
  • https://imagesift.com/about
  • electricmonk/3.2.0
  • +https://www.duedil.com/ourcrawler/
  • mindUpBot (datenbutler.de)
  • http://www.exabot.com/go/robot
  • https://extlinks.com/Bot.html
  • https://extlinks.com/Bot.html
  • http://garlik.com/, crawler@garlik.com
  • +http://www.profound.net/domainappender
  • HybridBot (hybrid.ru/about. Contact email: m.lyashkov@targetix.net)
  • http://corpora.informatik.uni-leipzig.de/crawler_faq.html
  • https://nlp.fi.muni.cz/projects/biwec/
  • (compatible; SpiderLing (a SPIDER for LINGustic research); +http://nlp.fi.muni.cz/projects/biwec/)
  • +LinkpadBot/1.07;++http://www.linkpad.ru
  • compatible; LinkpadBot/1.12; +http://www.linkpad.ru
  • MauiBot (crawler.feedback+wc@gmail.com)
  • 54.165.24.122
  • +MegaIndex.ru/2.0;++http://megaindex.com/crawler
  • https://openai.com/gptbot
  • https://openai.com/gptbot
  • http://OpenLinkProfiler.org/bot
  • http://OpenLinkProfiler.org/bot
  • http://www.opensiteexplorer.org/dotbot,+help@moz.com
  • 216.244.66.198
  • http://siteexplorer.info/Backlink-Checker-Spider/
  • https://metrics-tools.de/robot.html
  • Gluten Free Crawler/1.0; http://glutenfreepleasure.com/
  • Gluten Free Crawler/1.0; http://glutenfreepleasure.com/
  • http://www.jobboerse.com/bot.htm
  • https://jooble.org/jooble-bot
  • ltx71 - (http://ltx71.com/)
  • (compatible; Paqlebot/2.0; +http://www.paqle.dk/about/paqlebot).
  • http://www.proximic.com/info/spider.php
  • Crawler Bot
  • Quantcastbot/1.0 (+http://www.quantcast.com/bot)
  • http://re-re.ru/
  • http://www.scopia.co
  • SemrushBot/1.2~bl; +http://www.semrush.com/bot.html
  • http://seekport.com/
  • http://seekport.com/
  • http://suite.seozoom.it/bot.html
  • seoscanners.net/1; +spider@seoscanners.net
  • https://www.seobility.net/sites/bot.html
  • https://www.seokicks.de/robot.html
  • https://www.seokicks.de/robot.html
  • serpstatbot/1.0 (advanced backlink tracking bot; http://serpstatbot.com/;
  • http://www.sogou.com/docs/help/webmasters.htm#07
  • http://www.searchmetrics.com/de/searchmetricsbot/
  • http://www.infotiger.com/search
  • https://turnitin.com/robot/crawlerinfo.html
  • https://uptime.com/uptimebot
  • engineering@velen.io
  • http://www.website-datenbank.de
  • web-archive-net.com/1.1; +http://web-archive-net.com/bot
  • http://webmeup-crawler.com
  • Wotbox/2.01 (+http://www.wotbox.com/bot/)
  • http://yacy.net/bot.html
  • http://yandex.com/bots
  • http://www.yahoo-help.jp/app/answers/detail/p/595/a_id/42716/
  • ZoominfoBot (zoominfobot at zoominfo dot com)
  • Bei Kulturbox.de finden Sie neben der Kultursuchmaschine einen deutschlandweiten Veranstaltungskalender und # Hintergrundinformationen zu Theater, Museen, Musical, Konzerte und Galerien; Sitzpläne und Preisgruppen
  • Veranstaltungskalender, Kultursuchmaschine, Theater, Museen, Opern, Konzerte, Veranstaltungsprogramm, Repertoire, # Veranstaltungshinweise, Kultur, Komponisten, Künstler, Bühne, Musical, Galerien, Kulturinformationen, # Veranstaltungstermine, # Veranstaltungsorte, Berlin, Hamburg, München, Stuttgart, Leibzig, Dresden, Spielplan, Repertoire
  • exclude robots from specified tree

Warnings

  • 7 invalid lines.
  • `user agent` is not a known field.