linux-magazin.de
robots.txt
Robots Exclusion Standard data for linux-magazin.de
Resource Scan
Scan Details
Site Domain | linux-magazin.de |
Base Domain | linux-magazin.de |
Scan Status | Ok |
Last Scan | 2024-11-13T19:41:07+00:00 |
Next Scan | 2024-11-20T19:41:07+00:00 |
Last Scan
Scanned | 2024-11-13T19:41:07+00:00 |
URL | https://linux-magazin.de/robots.txt |
Redirect | https://www.linux-magazin.de/robots.txt |
Redirect Domain | www.linux-magazin.de |
Redirect Base | linux-magazin.de |
Domain IPs | 104.26.6.95, 104.26.7.95, 172.67.73.98, 2606:4700:20::681a:65f, 2606:4700:20::681a:75f, 2606:4700:20::ac43:4962 |
Redirect IPs | 104.26.6.95, 104.26.7.95, 172.67.73.98, 2606:4700:20::681a:65f, 2606:4700:20::681a:75f, 2606:4700:20::ac43:4962 |
Response IP | 104.26.6.95 |
Found | Yes |
Hash | d28f91fd3646c7afd0a2b7152a6058129912c45ec718509f2be79817a3d89d2c |
SimHash | 779f5092c383 |
Groups
ccbot
acontbot
amznkassocbot
aboutusbot
acoon-robot
advista
aqua_products
backdoorbot/1.0
baiduspider
blogpulselive
blowfish/1.0
bookmark search tool
botalot
builtbottough
bullseye/1.0
bunnyslippers
cfnetwork
cabot
cazoodlebot
charlotte
cheesebot
cherrypicker
cherrypickerelite/1.0
cherrypickerse/1.0
cizilla
copyrightcheck
crescent
crescent internet toolpak http ole control v.1.0
crystalsemanticsbot
custo
dittospyder
djangotraineebot
download ninja
emailcollector
emailsiphon
emailwolf
erocrawler
euripbot
eurobot
exdomain
exabot
extractorpro
fairad client
fairshare
fetch
flaming attackbot
foobot
gaisbot
galbot
getright/4.2
gigabot
httrack
harvest/1.5
heinrichdermiragorobot
ia_archiver
ia_archiver/1.6
irlbot
infonavirobot
iron33/1.0.2
jennybot
jobroboter
kenjin spider
keyword density/0.9
lnspiderguy
lexibot
linguee
linkscan/8.1a unix
linkwalker
linkextractorpro
miixpc
miixpc/4.2
mj12bot
msiecrawler
magpierss
mail.ru
mata hari
meltwater
microsoft url control
microsoft url control - 5.01.4511
microsoft url control - 6.00.8169
microsoft.url.control
mister pix
mozilla/4.0 (compatible; bullseye; windows 95)
nicerspro
nebullabot
nerdbynature.bot
netants
netmechanic
netluchs
nutch
ocelli
offline explorer
openbot
openfind
openfind data gathere
oracle ultra search
pagesinventory
perman
peterbot
plukkie
propowerbot/2.14
prowebwalker
python-urllib
queryn metasearch
rma
radiation retriever 1.1
repomonkey
repomonkey bait & tackle/v1.01
ruky-bot
seokicks-robot
scoutjet
sensis web crawler
sitesnagger
snapbot
snoopy
sosospider
spankbot
speedy
surveybot
surveybot_ignoreip
szukacz/1.4
tasapspider
teleport
teleportpro
telesoft
tencenttraveler
the intraformant
thenomad
touche
true_robot
true_robot/1.0
url control
url_spider_pro
urly warning
vci
vci webviewer vci webviewer win32
www-collector-e
web image collector
webauto
webbandit
webbandit/3.50
webcopier
webenhancer
webreaper
websauger
webstripper
webzip
webzip
webzip/4.0
webmastercoffee
webmasterworldforumbot
website quester
webster pro
wget
wget/1.5.3
wget/1.6
xenu
xenu's
xenu's link sleuth 1.1c
yandex bot
youdaobot
zealbot
zeus
zeus 32297 webster pro v2.9 win32
zeus link scout
zyborg
adnbot
asterias
b2w/0.1
cityreview
cosmos
crawly
discobot
dotbot
dotbot
echobot
envolk
eventax
findlinks
flatlandbot
fyberspider
gonzo*
grub
grub-client
heise-it-markt-crawler
hloader
httplib
humanlinks
iccrawler
ichiro
iearthworm
infometrics-bot
jobs.de-robot
kalooga
larbin
laycat
libweb/clshttp
libwww
linko
looksmart
lwp-request
lwp-trivial
lwp-trivial/1.34
moget
moget/2.1
naughtyrobot
psbot
search17
searchlink
searchpreview
semager
sitecheck.internetseer.com
spanner
stalker
suggybot
suzuran
thesubot
tocrawl/urldispatcher
trendiction
turingos
uipbot
uipbot/1.0 (uipbot@semasio.net)
wget
woriobot
heritrix
trendictionbot
ahrefsbot
sentibot
landau-media-spider
Rule | Path |
---|---|
Disallow | / |
*
Rule | Path |
---|---|
Disallow | /wp-admin/ |
Allow | /wp-admin/admin-ajax.php |
Disallow | /wp-content/plugins/ctec_mods/ausgaben-listing.php |
Disallow | /datenschutz/ |
Disallow | /common/ |
Other Records
Field | Value |
---|---|
sitemap | https://www.linux-magazin.de/sitemap_index.xml |
Warnings
- 2 invalid lines.