tiendalenovo.es
robots.txt

Robots Exclusion Standard data for tiendalenovo.es

Resource Scan

Scan Details

Site Domain tiendalenovo.es
Base Domain tiendalenovo.es
Scan Status Ok
Last Scan2024-09-24T03:50:57+00:00
Next Scan 2024-10-24T03:50:57+00:00

Last Scan

Scanned2024-09-24T03:50:57+00:00
URL https://tiendalenovo.es/robots.txt
Redirect https://www.tiendalenovo.es/robots.txt
Redirect Domain www.tiendalenovo.es
Redirect Base tiendalenovo.es
Domain IPs 51.254.5.65
Redirect IPs 51.254.5.65
Response IP 51.254.5.65
Found Yes
Hash 0693b01d0cff04ccf60df7877ba7fc9b9dbb848425948ad4955927b93b3640dd
SimHash 0317ea5242b1

Groups

*
googlebot

Rule Path
Allow /

adsbot-google

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

screaming frog seo spider

Rule Path
Allow /

slurp

Rule Path
Allow /

Other Records

Field Value
crawl-delay 20

noxtrumbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 50

msnbot

Rule Path
Allow /

Other Records

Field Value
crawl-delay 30

slurp

Rule Path
Allow /

Other Records

Field Value
crawl-delay 10

ahrefsbot

Rule Path
Allow /
Allow /pub/media/catalog/product/
Disallow /*?price_*
Disallow /index.php/
Disallow /*?
Disallow /checkout/
Disallow /vendor/
Disallow /app/
Disallow /*.php$
Disallow /pkginfo/
Disallow /report/
Disallow /var/
Disallow /catalog/
Disallow /customer/
Disallow /sendfriend/
Disallow /review/
Disallow /*SID%3D
Disallow /CVS
Disallow /*.svn$
Disallow /*.idea$
Disallow /*.sql$
Disallow /*.tgz$
Disallow /*.php$
Disallow /checkout/
Disallow /onestepcheckout/
Disallow /customer/
Disallow /customer/account/
Disallow /customer/account/login/
Disallow /customer/account/create/
Disallow /catalogsearch/
Disallow /catalog/product_compare/
Disallow /catalog/category/view/
Disallow /catalog/product/view/
Disallow /smartphones/smartphones-lenovo
Disallow /newsletter/
Disallow /poll/
Disallow /sendfriend/
Disallow /tag/
Disallow /wishlist/
Disallow /bin/
Disallow /dev/
Disallow /lib/
Disallow /phpserver/
Disallow /pub/
Disallow /*?dir*
Disallow /*?dir=desc
Disallow /*?dir=asc
Disallow /*?limit=all
Disallow /*?mode*
Disallow /composer.json
Disallow /composer.lock
Disallow /auth.json
Disallow /cron.php
Disallow /cron.sh
Disallow /error_log
Disallow /LICENSE.html
Disallow /LICENSE.txt
Disallow /LICENSE_AFL.txt
Disallow /STATUS.txt
Disallow /*.json$
Disallow /*.lock$
Disallow /*.conf$
Disallow /*.phar$
Disallow /*.sample$
Disallow /cookies
Disallow /privacidad
Disallow /contacto
Disallow /*/*/*/*/*
Disallow /*?*
Disallow /*?page=0
Disallow /*?page=1
Disallow /0
Disallow /amp/0

baiduspider
yandex
twiceler
gigabot
scrubby
robozilla
nutch
yeti
compspybot
curious george
cybeye.com
docomo
exb language crawler
ezooms
flamingo_searchengine
genieo
genio
lwnutch
lexxebot
openwebindex
rediffnewsbot
seoengworldbot
scanmine
screaming frog seo spider
shopwiki
showyoubot
sosospider
wocbot
yeti
youdaobot
daumoa
gsa-crawler
libcrawl
linkdex
magpie-crawler
repparser
rogerbot
sindice-site-manager
sogou spider
woriobot
yacybot
yolinkbot
baiduspider
yahoo! slurp china
yandexbot
yandex
archive.org_bot
semrushbot
yandeximages
baiduspider-image
msiecrawler
webcopier
python-urllib
url_spider_pro
emailcollector
emailsiphon
webbandit
emailwolf
extractorpro
copyrightcheck
alexibot
web image collector
xenu's link sleuth 1.1c
xenu's
zeus
zeus link scout
erocrawler
linkscan/8.1a unix
keyword density/0.9
webcopier v3.2a
webcapture 2.0
webcopier v.2.2
etaospider
black hole
xenu\\\'s link sleuth 1.1c
xenu\\\'s

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

duggmirror

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

httrack

Rule Path
Disallow /

microsoft.url.control

Rule Path
Disallow /

libwww

Rule Path
Disallow /

Comments

  • Bienvenido a el Robots de SUPPORT AND TECHNOLOGY :)
  • Google Image Crawler Setup
  • PRIVATE URLS
  • CORPORATIVE PAGES
  • STATICS
  • CRAWL BUDGET OPTIMIZATION
  • 3+ FILTERS
  • TECHNICAL RESOURCES
  • TECHNICAL ISSUES
  • Desautorizamos
  • Internet Archiver Wayback Machine
  • digg mirror

Warnings

  • 1 invalid line.