lamarinaplaza.com
robots.txt

Robots Exclusion Standard data for lamarinaplaza.com

Resource Scan

Scan Details

Site Domain lamarinaplaza.com
Base Domain lamarinaplaza.com
Scan Status Ok
Last Scan2024-06-12T20:30:22+00:00
Next Scan 2024-06-19T20:30:22+00:00

Last Scan

Scanned2024-06-12T20:30:22+00:00
URL https://lamarinaplaza.com/robots.txt
Domain IPs 165.227.148.175
Response IP 165.227.148.175
Found Yes
Hash 02164bb6570b0e5fc111f6d43c66bb6b1ffc9a163ebab2d685340bc65b56fda1
SimHash e35b51538182

Groups

*

Rule Path
Disallow /agenda/action~posterboard/*
Allow /wp-content/uploads/*
Allow /wp-content/*.js
Allow /wp-content/*.css
Allow /wp-includes/*.js
Allow /wp-includes/*.css
Disallow /cgi-bin
Disallow /wp-content/plugins/
Disallow /wp-content/themes/
Disallow /wp-includes/
Disallow /*/attachment/
Disallow /tag/*/page/
Disallow /tag/*/feed/
Disallow /page/
Disallow /comments/
Disallow /xmlrpc.php
Disallow /?attachment_id*
Disallow /tag/
Disallow /agenda-cultural/action~posterboard/
Disallow /agenda-cultural/action~agenda/*
Disallow /agenda-cultural/action~oneday/*
Disallow /agenda-cultural/action~month/*
Disallow /agenda-cultural/action~week/*
Disallow /agenda-cultural/action~stream/*
Disallow /agenda/action~undefined/
Disallow /agenda/action~http%3A/
Disallow /agenda/action~default/
Disallow /agenda/action~poster/
Disallow /agenda/action~*/
Disallow /agenda/action~posterboard/*
Disallow /agenda/action~agenda/*
Disallow /agenda/action~oneday/*
Disallow /agenda/action~month/*
Disallow /agenda/action~week/*
Disallow /agenda/action~stream/*
Disallow /calendar/action~posterboard/*
Disallow /calendar/action~agenda/*
Disallow /calendar/action~oneday/*
Disallow /calendar/action~month/*
Disallow /calendar/action~week/*
Disallow /calendar/action~stream/*

twitterbot

Rule Path
Allow /*/attachment/
Disallow /*?
Disallow /?s=
Disallow /search

googlebot

Rule Path
Allow /*.css$
Allow /*.js$
Allow *
Disallow /agenda-cultural/action~undefined/
Disallow /agenda-cultural/action~http%3A/
Disallow /agenda-cultural/action~default/
Disallow /agenda-cultural/action~poster/
Disallow /agenda-cultural/action~*/
Disallow /*controller%3Dai1ec_exporter_controller*
Disallow /*/action~*/
Disallow /tag/

noxtrumbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 20

slurp
abachobot
aboundexbot
acapbot
acunetix
admantx
adnormcrawler
ahrefsbot
anarchie
antibot
appie
aspseek
asterias
attach
attribot
autoemailspider
awooo
b2w
backdoorbot
backweb
baiduspider-image
bandit
batchftp
black hole
blackwidow
blexbot
blowfish
araturka
bot\ mailto
botalot
bubing
buddy
buibui-bot
builtbottough
bullseye
bumblebee
bunnyslippers
butterfly
buzzsumo
ccbot
cegbfeieh
checks.panopta.com
cheesebot
cherrypicker
cherrypickerelite
cherrypickerse
chinaclaw
cis455crawler
clariabot
claritybot
claritydailybot
cliqzbot
clickagy intelligence bot
clshttp
cms crawler
coast
coldfusion
collector
commoncrawler
commoncrawler node
copier
copyrightcheck
cosmos
crawler4j
crazywebcrawler
crazywebcrawler-spider
crescent
curl
custo
da
dataparksearch
deepcrawl
deusu
dialogsearch
diamond
diffbot
digincore
disco
dittospyder
dj-research
dloader
doc
domain re-animator bot
domainappender
domainmacrocrawler
domainsigmacrawler
dotbot
download
downloader
drip
dts\ agent
easydl
ecatch
eirgrabber
emailcollector
emailsiphon
emailwolf
erocrawler
exabot
experibot
express\ webpictures
extractorpro
extreme\ picture\ finder
eyenetie
ezooms robot
fast\ webcrawler
favorg
fetch
fetch\ api\ request
filehound
finbot
findxbot
flashget
flickbot
flipboardproxy
foobot
freefind
friendica
frontpage
generic
getintentcrawler
getintent crawler
getproxi
getright
getsmart
getweb!
gigablastopensource
gigabot
gluten free crawler
go-ahead-got-it
go!zilla
gocrawl
gotit
grabber
grabnet
grafula
grapeshot
grapeshotcrawler
grub-client
gsa-crawler
gulliver
guzzlehttp
haosouspider
harvest
hatena star
heretrix
hitboxdoctor
hosttracker/2.0
hosttracker
hloader
hmview
httpapp
httpfetcher
httplib
httpscraper
httptrack
httpviewer
httrack
humanlinks
ia_archiver
ias_crawler
image\ stripper
image\ sucker
inagist url resolver
indy\ library
infonavirobot
insitesbot
interget
internet\ ninja
internetseer
iria
irlbot
iskanie
istellabot
jack
james bot
java
jennybot
jersey
jetcar
jobo
joc\ web\ spider
jonzilla
js-kit url resolver
justview
k2spider
kazbtbot
kenjin\ spider
keyword\ density
komodiabot
lachesis
larbin
leechftp
lexibot
lftp
libby_
libweb
libwww-perl
libwwwperl
likse
link
linkdexbot
linkdex.com
linkextractorpro
linko
linkpadbot
linkscan
linkwalker
litefinder
livelapbot
lnspiderguy
ls session
lssrocketcrawler
ltx71
lwp-trivial
lwp\ request
madaali
mag-net
magnet
mass\ downloader
mata\ hari
mbot
meanpathbot
megaindex
memo
memorybot
mercator
metacarta
metauri
mewsoft\ search\ engine
mfc_tear_sample
microsoft\ url\ control
microsofturl
midown\ tool
miixpc
mirror
missigua
mister\ pix
mj12bot
moget
moreover
msfrontpage
msiecrawler
nationaldirectory\ webspider
nativehost
navroad
nearsite
nerdybot
net\ probe
net\ vampire
netants
netestate ne crawler
netmechanic
netresearchserver
netseer crawler
netshelter contentscan
netspider
netzip
newsme
nexuscache
nicerspro
niki-bot
nikto
ning
ninja
node/simplecrawler
npbot
nutch
obot
octopus
offline explorer
offline\ explorer
offline\ navigator
onestop
openfind
openfind\ data\ gatherer
openhosebot
orangebot
orgprobe
orthogaffe
our\ agent
pad-bot
pagegrabber
panopta
panoptastudybot
panscient
papa\ foto
paperlibot
pavuk
pcbrowser
peerindex
perl
perl lwp
photon
php
php\ version
phpot
ping
pingalink\ monitoring\ services
piplbot
plukkie
pockey
pompos
postano
privacyawarebot
propowerbot
prowebwalker
proximic
psbot
psycheclone
pulsecrawler
pulsepoint xt3 web scraper
pump
python-urllib
python\ urllib
queryn
queryseekerspider
quipu
raven
ravencrawler
rbot
realdownload
reaper
recorded future
recorder
reget
repomonkey
rico
riddler
rma
roboto
robots
robozilla
rogerbot
ruby
rukicrawler
safednsbot
salesintelligent
scooter
scoutabout
scrapy
screaming frog seo spider
searchie
semantic-visions
semanticbot
semrushbot
semrushbot-sa
seokicks-robot
seolyticscrawler
seznambot
showyoubot
simplecrawler
siphon
sistrix crawler
sitecheck
siteluxbot
sitesnagger
slysearch
slurp
smartdownload
smeshbot
smtbot
snake
snapbot
snoopy
softlistbot
sogou spider
sogou web spider
sogou web spider/4.0
sogou
spacebison
spankbot
spanner
spbot
spiderbot
spider - panopta
spinne
squirrly
sqworm
srmse/nutch
ssearch_bot
stealer
stratagems kumo
stripper
sucker
superbot
superhttp
surdotlybot
surfbot
suzuran
szukacz
takeout
tbot-nutch
telegrambot
teleport
teleportpro
telesoft
the\ intraformant
thenomad
tighttwatbot
titan
tocrawl
true_robot
turingos
turnitinbot
twmbot
typhoeus
ubicrawler
umbot-ln
uptimebot
uptimerobot/2.0
uptimerobot
urldispatcher
urly\ warning
vacuum
vagabondo
vayala
vci
vebot
vericitecrawler
vintage
voideye
voltron
w3c_validator
wbsearchbot
web\ downloader
web\ image\ collector
web\ sucker
webauto
webbandit
webcopier
webdownloader
webenhancer
webfetch
webgo
webhook
webindex
webleacher
webmasterworldforumbot
webminer
webmirror
webmole
webreaper
websauger
website
website\ extractor
website\ quester
websites
webster
webster\ pro
webstripper
websucker
webviewer
webvulncrawl
webvulnscan
webwhacker
webzip
wells
wesee_bot
wget
whacker
widow
wildsoft
winhttp
winhttprequest
wiseguys robot
woobot
woriobot
wotbox
www-collector-e
wwwoffle
x-cad-se
xaldon
xara
xenu
xovibot
y!tunnelpro
yacybot
yioopbot
zade
zao
zbot
zealbot
zeus
zumbot
zyborg

Rule Path
Disallow /

amazon-kendra-web-crawler-*

Product Comment
amazon-kendra-web-crawler-* all customers of Amazon Kendra's web crawler
Rule Path Comment
Disallow / disallow everything

googlebot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

bingbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 10

Comments

  • Bloqueo basico para todos los bots y crawlers
  • puede dar problemas por bloqueo de recursos en GWT
  • Bloqueo de busquedas en agenda
  • Bloqueo de las URL dinamicas
  • Bloqueo de busquedas
  • Previene problemas de recursos bloqueados en Google Webmaster Tools
  • Ralentizamos algunos bots que se suelen volver locos
  • Bloqueo de bots y crawlers poco utiles

Warnings

  • 3 invalid lines.