shopjura.com
robots.txt

Robots Exclusion Standard data for shopjura.com

Resource Scan

Scan Details

Site Domain shopjura.com
Base Domain shopjura.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-05-06T05:42:10+00:00
Next Scan 2024-08-04T05:42:10+00:00

Last Successful Scan

Scanned2023-01-23T18:56:03+00:00
URL https://shopjura.com/robots.txt
Domain IPs 192.124.249.19
Response IP 192.124.249.19
Found Yes
Hash 5330bfdbded3dbdc998f8e9f862e9e1b0c523eb80c72a51d947c95f01cf21353
SimHash 61b47b5545e1

Groups

*

Rule Path
Disallow /CVS
Disallow /*.svn$
Disallow /*.idea$
Disallow /*.sql$
Disallow /*.tgz$
Disallow /app/
Disallow /bin/
Disallow /dev/
Disallow /lib/
Disallow /phpserver/
Disallow /pkginfo/
Disallow /report/
Disallow /setup/
Disallow /update/
Disallow /var/
Disallow /vendor/
Disallow /api.php
Disallow /cron.php
Disallow /cron.sh
Disallow /error_log
Disallow /get.php
Disallow /install.php
Disallow /LICENSE.html
Disallow /LICENSE.txt
Disallow /LICENSE_AFL.txt
Disallow /LICENSE_EE.txt
Disallow /README.txt
Disallow /README_EE.txt
Disallow /RELEASE_NOTES.txt
Disallow /STATUS.txt
Disallow /magento-vars.php
Disallow /package.json
Disallow /php.ini.sample
Disallow /composer.json
Disallow /composer.lock
Disallow /auth.json
Disallow /Gruntfile.js
Disallow /phpunit.xml
Disallow /index.php/
Disallow /catalog/product_compare/
Disallow /catalog/category/view/
Disallow /catalog/product/view/
Disallow /catalogsearch/
Disallow /checkout/
Disallow /control/
Disallow /contacts/
Disallow /customer/
Disallow /customize/
Disallow /newsletter/
Disallow /review/
Disallow /sendfriend/
Disallow /wishlist/
Disallow /swagger/
Disallow /magento_version/
Disallow /*?dir*
Disallow /*?dir=desc
Disallow /*?dir=asc
Disallow /*?limit=all
Disallow /*?mode*
Disallow /*?SID=
Disallow /checkout/
Disallow /customer/account/
Disallow /customer/account/login/
Disallow /cgi-bin/
Disallow /cleanup.php
Disallow /apc.php
Disallow /memcache.php
Disallow /phpinfo.php
Disallow /404/
Disallow /downloader/
Disallow /errors/
Disallow /includes/
Disallow /magento/
Disallow /scripts/
Disallow /shell/
Disallow /stats/
Disallow /*.php$
Disallow /catalogsearch/result/?

Other Records

Field Value
crawl-delay 30

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

litefinder

Rule Path
Disallow /

googlebot

Rule Path
Allow /

googlebot-image

Rule Path
Allow /

slurp

Rule Path
Disallow /

baidu

Rule Path
Disallow /

asterias

Rule Path
Disallow /

backdoorbot/1.0

Rule Path
Disallow /

black hole

Rule Path
Disallow /

blowfish/1.0

Rule Path
Disallow /

botalot

Rule Path
Disallow /

builtbottough

Rule Path
Disallow /

bullseye/1.0

Rule Path
Disallow /

bunnyslippers

Rule Path
Disallow /

cegbfeieh

Rule Path
Disallow /

cheesebot

Rule Path
Disallow /

cherrypicker

Rule Path
Disallow /

cherrypickerelite/1.0

Rule Path
Disallow /

cherrypickerse/1.0

Rule Path
Disallow /

copyrightcheck

Rule Path
Disallow /

cosmos

Rule Path
Disallow /

crescent

Rule Path
Disallow /

crescent internet toolpak http ole control v.1.0

Rule Path
Disallow /

dittospyder

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

emailwolf

Rule Path
Disallow /

erocrawler

Rule Path
Disallow /

extractorpro

Rule Path
Disallow /

foobot

Rule Path
Disallow /

harvest/1.5

Rule Path
Disallow /

hloader

Rule Path
Disallow /

httplib

Rule Path
Disallow /

humanlinks

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

infonavirobot

Rule Path
Disallow /

jennybot

Rule Path
Disallow /

kenjin spider

Rule Path
Disallow /

keyword density/0.9

Rule Path
Disallow /

lexibot

Rule Path
Disallow /

libweb/clshttp

Rule Path
Disallow /

linkextractorpro

Rule Path
Disallow /

linkscan/8.1a unix

Rule Path
Disallow /

linkwalker

Rule Path
Disallow /

lnspiderguy

Rule Path
Disallow /

lwp-trivial

Rule Path
Disallow /

lwp-trivial/1.34

Rule Path
Disallow /

mata hari

Rule Path
Disallow /

microsoft url control - 5.01.4511

Rule Path
Disallow /

microsoft url control - 6.00.8169

Rule Path
Disallow /

miixpc

Rule Path
Disallow /

miixpc/4.2

Rule Path
Disallow /

mister pix

Rule Path
Disallow /

moget

Rule Path
Disallow /

moget/2.1

Rule Path
Disallow /

mozilla/4

Rule Path
Disallow /

mozilla/4.0 (compatible; bullseye; windows 95)

Rule Path
Disallow /

mozilla/4.0 (compatible; msie 4.0; windows 95)

Rule Path
Disallow /

mozilla/4.0 (compatible; msie 4.0; windows 98)

Rule Path
Disallow /

mozilla/4.0 (compatible; msie 4.0; windows nt)

Rule Path
Disallow /

mozilla/4.0 (compatible; msie 4.0; windows xp)

Rule Path
Disallow /

mozilla/4.0 (compatible; msie 4.0; windows 2000)

Rule Path
Disallow /

mozilla/4.0 (compatible; msie 4.0; windows me)

Rule Path
Disallow /

mozilla/5

Rule Path
Disallow /

netants

Rule Path
Disallow /

nicerspro

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

openfind

Rule Path
Disallow /

openfind data gathere

Rule Path
Disallow /

propowerbot/2.14

Rule Path
Disallow /

prowebwalker

Rule Path
Disallow /

queryn metasearch

Rule Path
Disallow /

repomonkey

Rule Path
Disallow /

repomonkey bait & tackle/v1.01

Rule Path
Disallow /

rma

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

spankbot

Rule Path
Disallow /

spanner

Rule Path
Disallow /

suzuran

Rule Path
Disallow /

szukacz/1.4

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

telesoft

Rule Path
Disallow /

the intraformant

Rule Path
Disallow /

thenomad

Rule Path
Disallow /

tighttwatbot

Rule Path
Disallow /

titan

Rule Path
Disallow /

tocrawl/urldispatcher

Rule Path
Disallow /

true_robot

Rule Path
Disallow /

true_robot/1.0

Rule Path
Disallow /

turingos

Rule Path
Disallow /

urly warning

Rule Path
Disallow /

vci

Rule Path
Disallow /

vci webviewer vci webviewer win32

Rule Path
Disallow /

web image collector

Rule Path
Disallow /

webauto

Rule Path
Disallow /

webbandit

Rule Path
Disallow /

webbandit/3.50

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

webenhancer

Rule Path
Disallow /

webmasterworldforumbot

Rule Path
Disallow /

websauger

Rule Path
Disallow /

website quester

Rule Path
Disallow /

webster pro

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webzip

Rule Path
Disallow /

webzip/4.0

Rule Path
Disallow /

wget

Rule Path
Disallow /

wget/1.5.3

Rule Path
Disallow /

wget/1.6

Rule Path
Disallow /

www-collector-e

Rule Path
Disallow /

xenu's

Rule Path
Disallow /

xenu's link sleuth 1.1c

Rule Path
Disallow /

zeus

Rule Path
Disallow /

zeus 32297 webster pro v2.9 win32

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

swebot

Rule Path
Disallow /

aihitbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

sitebot

Rule Path
Disallow /

yandexbot

Rule Path
Disallow /

yandex

Rule Path
Disallow /

twengabot-discover

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

bender

Rule Path
Disallow /

discobot

Rule Path
Disallow /

panscient.com

Rule Path
Disallow /

searchwebengine.net

Rule Path
Disallow /

mlbot

Rule Path
Disallow /

nextgensearchbot

Rule Path
Disallow /

speedy

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

nerdbynature.bot

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

sindicebot

Rule Path
Disallow /

plukkie

Rule Path
Disallow /

findfiles.net

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

goodzer

Rule Path
Disallow /

backlinkcrawler

Rule Path
Disallow /

lemurwebcrawler

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

discobot

Rule Path
Disallow /

fast enterprise crawler 6

Rule Path
Disallow /

sensis.com.au web crawler

Rule Path
Disallow /

worio bot heritrix

Rule Path
Disallow /

trovitbot

Rule Path
Disallow /

shoppimonagent

Rule Path
Disallow /
Disallow /*?color_group
Disallow /*?size
Disallow /*?type
Disallow /*?heel_height
Disallow /*?material
Disallow /*?division

Other Records

Field Value
sitemap https://shopjura.com/sitemap/jura/sitemap.xml
sitemap https://capresso.com/sitemap/capresso/sitemap.xml
sitemap https://shopjura.com/sitemap/jura/sitemap.xml

Comments

  • robots.txt
  • : Robots, spiders, and search engines use this file to detmine which
  • content they should *not* crawl while indexing your website.
  • : This system is called "The Robots Exclusion Standard."
  • : It is strongly encouraged to use a robots.txt validator to check
  • for valid syntax before any robots read it!
  • Examples:
  • Instruct all robots to stay out of the admin area.
  • : User-agent: *
  • : Disallow: /admin/
  • Restrict Google and MSN from indexing your images.
  • : User-agent: Googlebot
  • : Disallow: /images/
  • : User-agent: MSNBot
  • : Disallow: /images/
  • ****************************************************************************
  • robots.txt for Magento Community and Enterprise
  • GENERAL SETTINGS
  • Enable robots.txt rules for all crawlers
  • Crawl-delay parameter: number of seconds to wait between successive requests to the same server.
  • Set a custom crawl rate if you're experiencing traffic problems with your server.
  • Magento sitemap: uncomment and replace the URL to your Magento sitemap file (All stores)
  • DEVELOPMENT RELATED SETTINGS
  • Do not crawl development files and folders: CVS, svn directories and dump files
  • GENERAL MAGENTO SETTINGS
  • Do not crawl common Magento technical folders
  • Do not crawl common Magento files
  • Paths (clean URLs)
  • MAGENTO SEO IMPROVEMENTS
  • Do not crawl sub category pages that are sorted or filtered.
  • Do not crawl 2-nd home page copy (example.com/index.php/). Uncomment it only if you activated Magento SEO URLs.
  • Disallow: /index.php/
  • Do not crawl links with session IDs
  • Do not crawl checkout and user account pages
  • SERVER SETTINGS
  • Do not crawl common server technical folders and files
  • IMAGE CRAWLERS SETTINGS
  • Extra: Uncomment if you do not wish Google and Bing to index your images
  • User-agent: Googlebot-Image
  • Disallow: /
  • User-agent: msnbot-media
  • Disallow: /
  • From Inchoo Recommended robots.txt
  • http://inchoo.net/ecommerce/ultimate-magento-robots-txt-file-examples/
  • Directories
  • Disallow: /js/
  • Disallow: /media/
  • Disallow: /skin/
  • Paths (no clean URLs)
  • Disallow: /*.js$
  • Disallow: /*.css$
  • BOT BLACKLIST RELATED SETTINGS
  • too many repeated hits, too quick
  • too many repeated hits, too quick
  • From http://www.robotstxt.org/robots.txt
  • too many repeated hits, too quick
  • Google
  • Yahoo. too many repeated hits, too quick
  • too many repeated hits, too quick
  • From http://www.seobook.com/robots.txt
  • Begin block Bad-Robots from robots.txt
  • SEO-related bots
  • Bots
  • User-agent: Screaming Frog SEO Spider
  • Disallow: /

Warnings

  • 1 invalid line.