karlstadt.de
robots.txt

Robots Exclusion Standard data for karlstadt.de

Resource Scan

Scan Details

Site Domain karlstadt.de
Base Domain karlstadt.de
Scan Status Ok
Last Scan2024-06-11T23:07:41+00:00
Next Scan 2024-07-11T23:07:41+00:00

Last Scan

Scanned2024-06-11T23:07:41+00:00
URL https://karlstadt.de/robots.txt
Domain IPs 51.116.237.107
Response IP 51.116.237.107
Found Yes
Hash ff0305f9f9fd065372145f9e26f5aa1acaceac4563568cd8a313e5f3510ad98d
SimHash 64d65080c6a7

Groups

*

Rule Path
Disallow /calendar/icalendar.asp
Disallow /scripts/vcard.asp

googlebot

Rule Path
Disallow /direkt.asp

bingbot
adidxbot
bingpreview
msnbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 8

ahrefsbot
claudebot
baiduspider
blexbot
blogpulselive
crawly
dotbot
download ninja
echobot
envolk
euripbot
eurobot
exabot
fetch
flatlandbot
fyberspider
gigabot
gonzo*
grub-client
heise-it-markt-crawler
httrack
iccrawler
iearthworm
infometrics-bot
jakarta commons-httpclient
jobroboter
jobs.de-robot
kalooga
larbin
laycat
libwww
linko
lcc
mail.ru
microsoft.url.control
mj12bot
msiecrawler
nebullabot
netestate ne crawler
netestate foaf crawler
netestate rss crawler
netluchs
ocelli
offline explorer
pixray-seeker
psbot
ruky-bot
scoutjet
searchlink
semager
shopwiki
sitecheck.internetseer.com
sitesnagger
snapbot
sosospider
speedy
surveybot
surveybot_ignoreip
tasapspider
teleport
teleportpro
touche
twiceler
webcopier
webmeasurement-bot
webreaper
website-datenbank.de
websitewiki
webstripper
webzip
wget
woriobot
xenu
yandex
youdaobot
zealbot
zyborg

Rule Path
Disallow /

Comments

  • ---------------------------------------------
  • Einzelne Dateien ausschliessen...
  • ---------------------------------------------
  • ---------------------------------------------
  • Website nur alle X Sekunden besuchen...
  • ---------------------------------------------
  • ---------------------------------------------
  • Bots ausschliessen, die nur Traffic machen...
  • ---------------------------------------------