ls-mapping-team.de
robots.txt

Robots Exclusion Standard data for ls-mapping-team.de

Resource Scan

Scan Details

Site Domain ls-mapping-team.de
Base Domain ls-mapping-team.de
Scan Status Ok
Last Scan2025-10-02T11:18:10+00:00
Next Scan 2025-10-09T11:18:10+00:00

Last Scan

Scanned2025-10-02T11:18:10+00:00
URL https://ls-mapping-team.de/robots.txt
Domain IPs 2a01:4f9:2b:284e::2, 95.216.246.33
Response IP 95.216.246.33
Found Yes
Hash 10c3df595ad007a0ab325079ef6c45c91c51f46be46978a26ad55976a097b36b
SimHash 30d403108776

Groups

*

Rule Path
Disallow /acp/
Disallow /log/
Disallow /tests/
Disallow /sys/
Disallow /statistics-wbb/
Disallow /abc/

googlebot

Rule Path
Allow *.css
Allow *.js

obot

Rule Path
Disallow /

baiduspider
baiduspider
baiduspider+
baiduspider-video
baiduspider-image

Rule Path
Disallow /

megaindex.ru
megaindex.ru/
megaindex.ru/2.0
mozilla/5.0 (compatible; megaindex.ru/2.0; +https://www.megaindex.ru/?tab=linkanalyze)
mj12
mozilla/5.0 (compatible; mj12bot/v1.4.5; http://www.majestic12.co.uk/bot.php?+)
wotbox
ltx71 - (http://ltx71.com/)
scoutjet
mozilla/5.0 (compatible; blekkobot; scoutjet; +http://blekko.com/about/blekkobot)
springbot
shopspring springbot

Rule Path
Disallow /

webreaper
webcopier
offline explorer
httrack
microsoft.url.control
emailcollector
penthesilea
http://anonymouse.org/ (unix)
admantx
hybridbot

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

sistrix crawler

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

jobs.de-robot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

unisterbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

surveybot

Rule Path
Disallow /

seodiver

Rule Path
Disallow /

spbot

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

meanpathbot

Rule Path
Disallow /

backlinkcrawler

Rule Path
Disallow /

magpie-crawler

Rule Path
Disallow /

obot

Rule Path
Disallow /

fr-crawler

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

megaindex.com

Rule Path
Disallow /

cloudservermarketspider

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

careerbot

Rule Path
Disallow /

lipperhey-kaus-australis

Rule Path
Disallow /

seoscanners.net

Rule Path
Disallow /

metajobbot

Rule Path
Disallow /

spiderbot

Rule Path
Disallow /

linkstats

Rule Path
Disallow /

jobboersebot

Rule Path
Disallow /

iccrawler

Rule Path
Disallow /

plista

Rule Path
Disallow /

domain re-animator bot

Rule Path
Disallow /

lipperhey-kaus-australis

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

coccoc

Rule Path
Disallow /

um-ic

Rule Path
Disallow /

mindupbot

Rule Path
Disallow /

sg-orbiter

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

qwantify

Rule Path
Disallow /

kraken

Rule Path
Disallow /

plukkie

Rule Path
Disallow /

safednsbot

Rule Path
Disallow /

haosouspider

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

openhosebot

Rule Path
Disallow /

screaming frog seo spider

Rule Path
Disallow /

thumbsniper

Rule Path
Disallow /

r6_commentreader

Rule Path
Disallow /

implisensebot

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

aihitbot

Rule Path
Disallow /

trendictionbot

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

mozilla/5.0 (compatible; yandeximages/3.0; +http://yandex.com/bots)

Rule Path
Disallow /

cliqzbot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

slurp

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

seoscanners.net/1

Rule Path
Disallow /

seoscanners.net

Rule Path
Disallow /

seoscanners

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

mozilla/5.0 (compatible; mj12bot/v1.4.4; http://www.majestic12.co.uk/bot.php?+)

Rule Path
Disallow /

mediatoolkitbot

Rule Path
Disallow /

ai2bot
bot\*
ai2bot-dolma
aihitbot
amazonbot
anthropic-ai
applebot
applebot-extended
brightbot 1.0
bytespider
ccbot
chatgpt-user
claude-searchbot
claude-user
claude-web
claudebot
cohere-ai
cohere-training-data-crawler
cotoyogi
crawlspace
diffbot
duckassistbot
facebookbot
facebook
factset_spyderbot
firecrawlagent
friendlycrawler
google-cloudvertexbot
google-extended
googleother
googleother-image
googleother-video
gptbot
iaskspider/2.0
icc-crawler
imagesiftbot
img2dataset
imgproxy
isscyberriskcrawler
kangaroo bot
meta-externalagent
meta-externalagent
meta-externalfetcher
meta-externalfetcher
mistralai-user/1.0
novaact
oai-searchbot
omgili
omgilibot
operator
pangubot
perplexity-user
perplexitybot
petalbot
qualifiedbot
scrapy
semrushbot-ocob
semrushbot-swa
sidetrade indexer bot
tiktokspider
timpibot
velenpublicwebcrawler
webzio-extended
wpbot
youbot

Rule Path
Disallow /

Comments

  • ===================================
  • Folgende Seiten sollen nicht indexiert werden:
  • ===================================
  • Googlebot
  • ===================================
  • Schlie�e folgende Spider komplett aus:
  • ===================================
  • Google Image Crawler Setup
  • User-agent: Googlebot-Image
  • Disallow:
  • Baidu Crawler Setup
  • Unwanted Crawlers Setup
  • Disallow: Sistrix
  • Disallow: Sistrix
  • Disallow: Sistrix
  • Disallow: SEOkicks-Robot
  • Disallow: jobs.de-Robot
  • Backlink Analysis
  • Bot der Leipziger Unister Holding GmbH
  • http://moz.com/products
  • http://www.searchmetrics.com
  • http://www.majestic12.co.uk/projects/dsearch/mj12bot.php
  • http://www.domaintools.com/webmasters/surveybot.php
  • http://www.seodiver.com/bot
  • http://openlinkprofiler.org/bot
  • http://www.wotbox.com/bot/
  • http://www.opensiteexplorer.org/dotbot
  • http://moz.com/researchtools/ose/dotbot
  • http://www.meanpath.com/meanpathbot.html
  • http://www.backlinktest.com/crawler.html
  • http://www.brandwatch.com/magpie-crawler/
  • http://filterdb.iss.net/crawler/
  • http://webmeup-crawler.com
  • https://megaindex.com/crawler
  • http://www.cloudservermarket.com
  • http://www.trendiction.de/de/publisher/bot
  • http://www.exalead.com
  • http://www.career-x.de/bot.html
  • https://www.lipperhey.com/en/about/
  • https://www.lipperhey.com/en/about/
  • https://turnitin.com/robot/crawlerinfo.html
  • http://help.coccoc.com/
  • ubermetrics-technologies.com
  • datenbutler.de
  • http://searchgears.de/uber-uns/crawling-faq.html
  • http://commoncrawl.org/faq/
  • https://www.qwant.com/
  • http://linkfluence.net/
  • http://www.botje.com/plukkie.htm
  • https://www.safedns.com/searchbot
  • http://www.haosou.com/help/help_3_2.html
  • http://www.haosou.com/help/help_3_2.html
  • http://www.moz.com/dp/rogerbot
  • http://www.openhose.org/bot.html
  • http://www.screamingfrog.co.uk/seo-spider/
  • http://thumbsniper.com
  • http://www.radian6.com/crawler
  • http://cliqz.com/company/cliqzbot
  • https://www.aihitdata.com/about
  • http://www.trendiction.com/en/publisher/bot
  • http://warebay.com/bot.html
  • new after here

Warnings

  • 3 invalid lines.