abitare-living.lu
robots.txt

Robots Exclusion Standard data for abitare-living.lu

Resource Scan

Scan Details

Site Domain abitare-living.lu
Base Domain abitare-living.lu
Scan Status Ok
Last Scan2024-11-10T12:34:31+00:00
Next Scan 2024-12-10T12:34:31+00:00

Last Scan

Scanned2024-11-10T12:34:31+00:00
URL https://abitare-living.lu/robots.txt
Domain IPs 51.254.81.226
Response IP 51.254.81.226
Found Yes
Hash 86107066b0875705de5b09ba48de95a6d71b27ad57be1192952804321829f63f
SimHash 7f0467714c96

Groups

*

Product Comment
* all robots
Rule Path Comment
Allow / are disallowed to crawl all pages
Disallow *?q= -
Disallow *?query= -
Disallow /cart/* -
Disallow /account/* -
Allow /robots.txt -
Allow /sitemap.xml -

googlebot

Product Comment
googlebot except Googlebot
Rule Path Comment
Allow / can crawl all content

woorank

Rule Path
Disallow /

bingbot

Product Comment
bingbot except Bingbot
Rule Path Comment
Allow / can crawl all content

Other Records

Field Value
crawl-delay 20

mj12bot

Product Comment
mj12bot block MJ12bot
Rule Path
Disallow /

dotbot

Product Comment
dotbot block DotBot
Rule Path
Disallow /

zoominfobot

Product Comment
zoominfobot block ZoominfoBot
Rule Path
Disallow /

seokicks

Product Comment
seokicks block SEOkicks
Rule Path
Disallow /

semrushbot

Product Comment
semrushbot block SemrushBot
Rule Path
Disallow /

*

Product Comment
* all robots
Rule Path Comment
Allow / are disallowed to crawl all pages
Disallow *?q= -
Disallow *?query= -
Disallow /cart/* -
Disallow /account/* -
Allow /robots.txt -
Allow /sitemap.xml -

googlebot

Product Comment
googlebot except Googlebot
Rule Path Comment
Allow / can crawl all content

woorank

Rule Path
Disallow /

bingbot

Product Comment
bingbot except Bingbot
Rule Path Comment
Allow / can crawl all content

Other Records

Field Value
crawl-delay 20

mj12bot

Product Comment
mj12bot block MJ12bot
Rule Path
Disallow /

dotbot

Product Comment
dotbot block DotBot
Rule Path
Disallow /

zoominfobot

Product Comment
zoominfobot block ZoominfoBot
Rule Path
Disallow /

seokicks

Product Comment
seokicks block SEOkicks
Rule Path
Disallow /

semrushbot

Product Comment
semrushbot block SemrushBot
Rule Path
Disallow /

yandexbot

Product Comment
yandexbot block YandexBot
Rule Path
Disallow /

mediapartners
bingbot
slurp
linkedinbot
python-urllib
python-requests
libwww-perl
httpunit
nutch
go-http-client
phpcrawl
msnbot
jyxobot
fast-webcrawler
fast enterprise crawler
biglotron
teoma
convera
seekbot
gigabot
gigablast
exabot
ia_archiver
gingercrawler
webmon
httrack
grub.org
usinenouvellecrawler
antibot
netresearchserver
speedy
fluffy
findlink
msrbot
panscient
yacybot
aisearchbot
ips-agent
tagoobot
mj12bot
woriobot
yanga
buzzbot
mlbot
yandexbot
yandeximages
yandexaccessibilitybot
yandexmobilebot
purebot
linguee bot
cyberpatrol
voilabot
baiduspider
citeseerxbot
spbot
twengabot
postrank
turnitinbot
scribdbot
page2rss
sitebot
linkdex
adidxbot
ezooms
dotbot
mail.ru_bot
discobot
heritrix
findthatfile
europarchive.org
nerdbynature.bot
sistrix crawler
fuelbot
crunchbot
indeedbot
mappydata
woobot
zoominfobot
privacyawarebot
multiviewbot
swimgbot
grobbot
eright
apercite
semanticbot
aboundex
domaincrawler
wbsearchbot
summify
ccbot
edisterbot
seznambot
ec2linkfinder
gslfbot
aihitbot
intelium_bot
facebookexternalhit
yeti
retrevopageanalyzer
lb-spider
sogou
lssbot
careerbot
wotbox
wocbot
ichiro
duckduckbot
lssrocketcrawler
drupact
webcompanycrawler
acoonbot
openindexspider
gnam gnam spider
web-archive-net.com.bot
backlinkcrawler
coccoc
integromedb
content crawler spider
toplistbot
it2media-domain-crawler
ip-web-crawler.com
siteexplorer.info
elisabot
proximic
changedetection
arabot
wesee:search
niki-bot
crystalsemanticsbot
rogerbot
psbot
interfaxscanbot
cc metadata scaper
g00g1e.net
grapeshotcrawler
urlappendbot
brainobot
fr-crawler
binlar
simplecrawler
twitterbot
cxensebot
smtbot
bnf.fr_bot
a6-indexer
admantx
facebot
orangebot
memorybot
advbot
megaindex
semanticscholarbot
ltx71
nerdybot
xovibot
bubing
qwantify
archive.org_bot
applebot
tweetmemebot
crawler4j
findxbot
semrushbot
yoozbot
lipperhey
y!j
domain re-animator bot
addthis
screaming frog seo spider
metauri
scrapy
livelap[bb]ot
openhosebot
capsulechecker
collection@infegy.com
istellabot
deusu
betabot
cliqzbot
mojeekbot
netestate ne crawler
safesearch microdata crawler
gluten free crawler
sonic
sysomos
trove
deadlinkchecker
slack-imgproxy
embedly
rankactivelinkbot
iskanie
safednsbot
skypeuripreview
veoozbot
slackbot
redditbot
datagnionbot
adbeat_bot
whatsapp
contxbot
pinterest.com.bot
electricmonk
garlikcrawler
bingpreview
vebidoobot
femtosearchbot
yahoo link preview
metajobbot
domainstatsbot
mindupbot
daum
jugendschutzprogramm-crawler
xenu link sleuth
pcore-http
moatbot
kosmiobot
pingdom
appinsights
phantomjs
gowikibot
piplbot
discordbot
telegrambot
jetslide
newsharecounts
james bot
bark[rr]owler
tineye
socialrankiobot
trendictionbot
ocarinabot
epicbot
primalbot
duckduckgo-favicons-bot
gnowitnewsbot
leikibot
linkarchiver
yak
paperlibot
digg deeper
dcrawl
snacktory
anderspinkbot
fyrebot
everyonesocialbot
mediatoolkitbot
luminator-robots
extlinksbot
surveybot
ning
okhttp
nuzzel
omgili
pocketparser
yisouspider
um-ln
toutiaospider
muckrack
jamie's spider
ahc
netcraftsurveyagent
laserlikebot
jetty
upflow
thinklab
traackr.com
twurly
mastodon
http_get
dnyzbot
botify
behloolbot
brandverity
check_http
bdcbot
zumbot
ezid
icc-crawler
archivebot
filterdb.iss.netcrawler
blp_bbot
bomborabot
buck
companybook-crawler
genieo
magpie-crawler
meltwaternews
moreover
newspaper
scoutjet
storygizebot
uptimerobot
outclicksbot
seoscanners
hatena
mauibot
alphabot
sbl-bot
ias crawler
adscanner
netvibes
acapbot
baidu-yunguance
bitlybot
blogmurabot
bot.araturka.com
bot-pge.chlooe.com
boxcarbot
btwebclient
contextad bot
digincore bot
disqus
feedly
fetch
fever
flamingo_searchengine
flipboardproxy
g2reader-bot
g2 web services
imrbot
k7mlwcbot
kemvibot
landau-media-spider
linkapediabot
vkshare
siteimprove.com
blexbot
dareboost
zuperlistbot
miniflux
feedspot
diffbot
seokicks
tracemyfile
nimbostratus-bot
zgrab
pr-cy.ru
adstxtcrawler
datafeedwatch
zabbix
tangibleebot
axios
amazon cloudfront
pulsepoint
cloudflare-alwaysonline
wordupinfosearch
webdatastats
httpurlconnection
seekport crawler
zoombot
velenpublicwebcrawler
moodlebot
jpg-newsbot
outbrain
w3c_validator
validator.nu
w3c-checklink
w3c-mobileok
w3c_i18n-checker
feedvalidator
w3c_css_validator
w3c_unicorn
blackboard
icbot
bazqux
twingly
rivva
experibot
awesomecrawler
dataprovider.com
grouphigh
theoldreader.com
anyevent
uptimebot.org
nmap scripting engine
clickagy
caliperbot
mbcrawler
online-webceo-bot
b2b bot
addsearchbot
hubspot
chrome-lighthouse
headlesschrome
checkmarknetwork
www.uptime.com
streamline3bot
serpstatbot
mixnodecache
simplescraper
rssingbot
jooblebot
fedoraplanet
friendica
nextcloud
tiny tiny rss
regionstuttgartbot
bytespider
datanyze
trendsmapresolver
tweetedtimes
ntentbot
gwene
simplepie
searchatlas
superfeedr
feedbot
ut-dorkbot
amazonbot
serendeputybot
eyeotabot
officestorebot
neticle crawler
surdotlybot
linkisbot
awariosmartbot
awariorssbot
rytebot
freewebmonitoring sitechecker
aspiegelbot
naver blog rssbot
zenback bot
sentibot
domains project
pandalytics
vkrobot
bidswitchbot
tigerbot
nixstatsbot
atom feed robot
curebot
pagepeeker
vigil
rssbot
startmebot
jobboersebot
seewithkids
ninja bot
cutbot
bublupbot
brandonbot
ridderbot
yandexmetrika
yandexturbo
yandeximageresizer
yandexvideoparser
taboolabot
dubbotbot
finditanswersbot
infoobot
refindbot
blogtrafficd.d+ feed-fetcher
seobilitybot
cincraw
dragonbot
voluumdsp-content-bot
freshrss
bitbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://abitare-living.lu/sitemap.xml
sitemap https://abitare-kids.lu/sitemap.xml

Warnings

  • 8 invalid lines.