sushiya.ua
robots.txt

Robots Exclusion Standard data for sushiya.ua

Resource Scan

Scan Details

Site Domain sushiya.ua
Base Domain sushiya.ua
Scan Status Ok
Last Scan2024-09-27T08:38:08+00:00
Next Scan 2024-10-27T08:38:08+00:00

Last Scan

Scanned2024-09-27T08:38:08+00:00
URL https://sushiya.ua/robots.txt
Domain IPs 104.21.96.19, 172.67.150.67, 2606:4700:3031::6815:6013, 2606:4700:3036::ac43:9643
Response IP 172.67.150.67
Found Yes
Hash c3e4e3b394bcb75db95497d04ef61b31ba9c5b89afb5a7d010a440031571e031
SimHash f232355a4873

Groups

*

Rule Path
Disallow /administrator/
Disallow /api/
Disallow /bin/
Disallow /cache/
Disallow /cli/
Disallow /components/
Disallow /includes/
Disallow /installation/
Disallow /language/
Disallow /layouts/
Disallow /libraries/
Disallow /logs/
Disallow /modules/
Disallow /plugins/
Disallow /tmp/

*

Rule Path
Disallow /checkout
Disallow /app/
Disallow */sushi/
Disallow /lib/
Disallow /bin/
Disallow /setup/
Disallow /update/
Disallow /vendor/
Disallow /review/
Disallow /*/review/
Disallow */regions/
Disallow */stores/store/redirect/
Disallow */customer/account/login/
Disallow */?location=
Disallow */?auth_popup_open=login
Disallow */?features=
Disallow */?p=

baiduspider

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

yodaobot

Rule Path
Disallow /

sputnikbot

Rule Path
Disallow /

moget

Rule Path
Disallow /

ichiro

Rule Path
Disallow /

naverbot

Rule Path
Disallow /

yeti

Rule Path
Disallow /

obot

Rule Path
Disallow /

addthis

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

embedly

Rule Path
Disallow /

paperlibot

Rule Path
Disallow /

genieo

Rule Path
Disallow /

showyoubot

Rule Path
Disallow /

tweetmemebot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

surveybot

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

bdcbot

Rule Path
Disallow /

bdcbot/1.0

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

spbot

Rule Path
Disallow /

linkdexbot

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

haosouspider

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

grapeshot

Rule Path
Disallow /

proximic

Rule Path
Disallow /

linkpadbot

Rule Path
Disallow /

sunrise

Rule Path
Disallow /

butterfly

Rule Path
Disallow /

linguee bot

Rule Path
Disallow /

twengabot-2.0

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

amznkassocbot/4.0

Rule Path
Disallow /

nerdybot

Rule Path
Disallow /

trovitbot

Rule Path
Disallow /

shopwiki

Rule Path
Disallow /

siteexplorer

Rule Path
Disallow /

xovibot

Rule Path
Disallow /

smtbot

Rule Path
Disallow /

meanpathbot

Rule Path
Disallow /

mixrankbot

Rule Path
Disallow /

easouspider

Rule Path
Disallow /

riddler

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

swiftbot

Rule Path
Disallow /

qwantify

Rule Path
Disallow /

psbot

Rule Path
Disallow /

crazywebcrawler-spider

Rule Path
Disallow /

hypercrawl

Rule Path
Disallow /

daumoa

Rule Path
Disallow /

coccoc

Rule Path
Disallow /

netseer

Rule Path
Disallow /

backlinkcrawler

Rule Path
Disallow /

alexabot

Rule Path
Disallow /

surdotlybot

Rule Path
Disallow /

megaindex.ru/2.0

Rule Path
Disallow /

megaindex.ru

Rule Path
Disallow /

yacybot

Rule Path
Disallow /

queryseekerspider

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

deusu

Rule Path
Disallow /

tdjbot

Rule Path
Disallow /

findxbot/1.0

Rule Path
Disallow /

findxbot

Rule Path
Disallow /

y! j-asr

Rule Path
Disallow /

infoweb

Rule Path
Disallow /

nutch bots

Rule Path
Disallow /

python-urllib

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

unwindfetchor

Rule Path
Disallow /

flipboard

Rule Path
Disallow /

voltron

Rule Path
Disallow /

istellabot

Rule Path
Disallow /

silk

Rule Path
Disallow /

wget

Rule Path
Disallow /

uptimerobot

Rule Path
Disallow /

uptimerobot/2.0

Rule Path
Disallow /

screenerbot

Rule Path
Disallow /

searchmetricsbot

Rule Path
Disallow /

wesee

Rule Path
Disallow /

linkdexbot/2.0

Rule Path
Disallow /

linkdexbot/2.1

Rule Path
Disallow /

linkdexbot/2.2

Rule Path
Disallow /

uptimebot

Rule Path
Disallow /

uptimebot/1.0

Rule Path
Disallow /

lipperhey

Rule Path
Disallow /

ltx71

Rule Path
Disallow /

python-urllib/2.7

Rule Path
Disallow /

sogou web spider/4.0

Rule Path
Disallow /

baiduspider/2.0

Rule Path
Disallow /

privacyawarebot/1.1

Rule Path
Disallow /

yoozbot-2.2

Rule Path
Disallow /

wget

Rule Path
Disallow /

tweetmeme

Rule Path
Disallow /

docomo/2.0

Rule Path
Disallow /

ichiro/4.0

Rule Path
Disallow /

ichiro/3.0

Rule Path
Disallow /

twengabot-discover

Rule Path
Disallow /

twengabot/2.0

Rule Path
Disallow /

twengabot

Rule Path
Disallow /

bendercrawler

Rule Path
Disallow /

baidu

Rule Path
Disallow /

bsalsa

Rule Path
Disallow /

phpservermon/3.1.1

Rule Path
Disallow /

gluten free crawler/1.0

Rule Path
Disallow /

gluten free crawler

Rule Path
Disallow /

sogou pic spider/3.0

Rule Path
Disallow /

sogou head spider/3.0

Rule Path
Disallow /

sogou orion spider/3.0

Rule Path
Disallow /

sogou-test-spider/4.0

Rule Path
Disallow /

sogou pic agent

Rule Path
Disallow /

baiduspider+

Rule Path
Disallow /

baiduspider-video

Rule Path
Disallow /

baiduspider-image

Rule Path
Disallow /

youdaobot/1.0

Rule Path
Disallow /

changedetection

Rule Path
Disallow /

surdotlybot

Rule Path
Disallow /

libwww-perl

Rule Path
Disallow /

mojeekbot

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

Comments

  • If the Joomla site is installed within a folder
  • eg www.example.com/joomla/ then the robots.txt file
  • MUST be moved to the site root
  • eg www.example.com/robots.txt
  • AND the joomla folder name MUST be prefixed to all of the
  • paths.
  • eg the Disallow rule for the /administrator/ folder MUST
  • be changed to read
  • Disallow: /joomla/administrator/
  • For more information about the robots.txt standard, see:
  • https://www.robotstxt.org/orig.html

Warnings

  • 12 invalid lines.