riff.net.pl
robots.txt

Robots Exclusion Standard data for riff.net.pl

Resource Scan

Scan Details

Site Domain riff.net.pl
Base Domain riff.net.pl
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-07-26T04:01:37+00:00
Next Scan 2024-10-24T04:01:37+00:00

Last Successful Scan

Scanned2024-03-06T03:59:39+00:00
URL https://riff.net.pl/robots.txt
Domain IPs 104.26.14.47, 104.26.15.47, 172.67.72.229, 2606:4700:20::681a:e2f, 2606:4700:20::681a:f2f, 2606:4700:20::ac43:48e5
Response IP 104.26.15.47
Found Yes
Hash fa379136a03b3bbffc3b8b028c4c635d79476e8cb24b98f0a41d91a3c80a7891
SimHash a28e7b4a05b7

Groups

*

Rule Path
Allow */modules/*.css
Allow */modules/*.js
Allow */modules/*.png
Allow */modules/*.jpg
Allow /js/jquery/*
Disallow /*?order=
Disallow /*?tag=
Disallow /*?id_currency=
Disallow /*?search_query=
Disallow /*?back=
Disallow /*?n=
Disallow /*%26order%3D
Disallow /*%26tag%3D
Disallow /*%26id_currency%3D
Disallow /*%26search_query%3D
Disallow /*%26back%3D
Disallow /*%26n%3D
Disallow /*controller%3Daddresses
Disallow /*controller%3Daddress
Disallow /*controller%3Dauthentication
Disallow /*controller%3Dcart
Disallow /*controller%3Ddiscount
Disallow /*controller%3Dfooter
Disallow /*controller%3Dget-file
Disallow /*controller%3Dheader
Disallow /*controller%3Dhistory
Disallow /*controller%3Didentity
Disallow /*controller%3Dimages.inc
Disallow /*controller%3Dinit
Disallow /*controller%3Dmy-account
Disallow /*controller%3Dorder
Disallow /*controller%3Dorder-slip
Disallow /*controller%3Dorder-detail
Disallow /*controller%3Dorder-follow
Disallow /*controller%3Dorder-return
Disallow /*controller%3Dorder-confirmation
Disallow /*controller%3Dpagination
Disallow /*controller%3Dpassword
Disallow /*controller%3Dpdf-invoice
Disallow /*controller%3Dpdf-order-return
Disallow /*controller%3Dpdf-order-slip
Disallow /*controller%3Dproduct-sort
Disallow /*controller%3Dsearch
Disallow /*controller%3Dstatistics
Disallow /*controller%3Dattachment
Disallow /*controller%3Dguest-tracking
Disallow /app/
Disallow /cache/
Disallow /classes/
Disallow /config/
Disallow /controllers/
Disallow /download/
Disallow /js/
Disallow /localization/
Disallow /log/
Disallow /mails/
Disallow /modules/
Disallow /override/
Disallow /pdf/
Disallow /src/
Disallow /tools/
Disallow /translations/
Disallow /upload/
Disallow /var/
Disallow /vendor/
Disallow /webservice/
Disallow /pl/app/
Disallow /pl/cache/
Disallow /pl/classes/
Disallow /pl/config/
Disallow /pl/controllers/
Disallow /pl/download/
Disallow /pl/js/
Disallow /pl/localization/
Disallow /pl/log/
Disallow /pl/mails/
Disallow /pl/modules/
Disallow /pl/override/
Disallow /pl/pdf/
Disallow /pl/src/
Disallow /pl/tools/
Disallow /pl/translations/
Disallow /pl/upload/
Disallow /pl/var/
Disallow /pl/vendor/
Disallow /pl/webservice/
Disallow /*pl/odzyskiwanie-hasla
Disallow /*pl/adres
Disallow /*pl/adresy
Disallow /*pl/logowanie
Disallow /*pl/koszyk
Disallow /*pl/rabaty
Disallow /*pl/historia-zamowien
Disallow /*pl/dane-osobiste
Disallow /*pl/moje-konto
Disallow /*pl/sledzenie-zamowienia
Disallow /*pl/potwierdzenie-zwrotu
Disallow /*pl/zam%C3%B3wienie
Disallow /*pl/szukaj
Disallow /*pl/sledzenie-zamowien-gosci
Disallow /*pl/potwierdzenie-zamowienia

abonti

Rule Path
Disallow /

aggregator

Rule Path
Disallow /

asterias

Rule Path
Disallow /

bdcbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

builtbottough

Rule Path
Disallow /

bullseye

Rule Path
Disallow /

bunnyslippers

Rule Path
Disallow /

ca-crawler

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

cegbfeieh

Rule Path
Disallow /

cheesebot

Rule Path
Disallow /

cherrypicker

Rule Path
Disallow /

copyrightcheck

Rule Path
Disallow /

cosmos

Rule Path
Disallow /

crescent

Rule Path
Disallow /

discobot

Rule Path
Disallow /

dittospyder

Rule Path
Disallow /

doc

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

easouspider

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

emailwolf

Rule Path
Disallow /

erocrawler

Rule Path
Disallow /

exabot

Rule Path
Disallow /

extractorpro

Rule Path
Disallow /

fasterfox

Rule Path
Disallow /

feedbooster

Rule Path
Disallow /

foobot

Rule Path
Disallow /

genieo

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

harvest

Rule Path
Disallow /

hloader

Rule Path
Disallow /

httplib

Rule Path
Disallow /

httrack

Rule Path
Disallow /

humanlinks

Rule Path
Disallow /

ieautodiscovery

Rule Path
Disallow /

infonavirobot

Rule Path
Disallow /

istellabot

Rule Path
Disallow /

java/1.

Rule Path
Disallow /

jennybot

Rule Path
Disallow /

k2spider

Rule Path
Disallow /

kenjin spider

Rule Path
Disallow /

keyword density/0.9

Rule Path
Disallow /

larbin

Rule Path
Disallow /

lexibot

Rule Path
Disallow /

libweb

Rule Path
Disallow /

libwww

Rule Path
Disallow /

linkextractorpro

Rule Path
Disallow /

linko

Rule Path
Disallow /

linkscan/8.1a unix

Rule Path
Disallow /

linkwalker

Rule Path
Disallow /

lnspiderguy

Rule Path
Disallow /

lwp-trivial

Rule Path
Disallow /

magpie

Rule Path
Disallow /

mata hari

Rule Path
Disallow /

maxpointcrawler

Rule Path
Disallow /

megaindex

Rule Path
Disallow /

microsoft url control

Rule Path
Disallow /

miixpc

Rule Path
Disallow /

mippin

Rule Path
Disallow /

missigua locator

Rule Path
Disallow /

mister pix

Rule Path
Disallow /

moget

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

netants

Rule Path
Disallow /

nicerspro

Rule Path
Disallow /

niki-bot

Rule Path
Disallow /

npbot

Rule Path
Disallow /

nutch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

openfind

Rule Path
Disallow /

panscient.com

Rule Path
Disallow /

php/5.{

Rule Path
Disallow /

propowerbot/2.14

Rule Path
Disallow /

prowebwalker

Rule Path
Disallow /

python-urllib

Rule Path
Disallow /

queryn metasearch

Rule Path
Disallow /

repomonkey

Rule Path
Disallow /

rma

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

seznambot

Rule Path
Disallow /

sistrix

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

snappreviewbot

Rule Path
Disallow /

sogou

Rule Path
Disallow /

spankbot

Rule Path
Disallow /

spanner

Rule Path
Disallow /

spbot

Rule Path
Disallow /

spinn3r

Rule Path
Disallow /

suzuran

Rule Path
Disallow /

szukacz/1.4

Rule Path
Disallow /

teleport

Rule Path
Disallow /

telesoft

Rule Path
Disallow /

the intraformant

Rule Path
Disallow /

thenomad

Rule Path
Disallow /

tighttwatbot

Rule Path
Disallow /

titan

Rule Path
Disallow /

tocrawl/urldispatcher

Rule Path
Disallow /

true_robot

Rule Path
Disallow /

turingos

Rule Path
Disallow /

turnitinbot

Rule Path
Disallow /

ubicrawler

Rule Path
Disallow /

unisterbot

Rule Path
Disallow /

urly warning

Rule Path
Disallow /

vci

Rule Path
Disallow /

wbsearchbot

Rule Path
Disallow /

web downloader/6.9

Rule Path
Disallow /

web image collector

Rule Path
Disallow /

webauto

Rule Path
Disallow /

webbandit

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

webenhancer

Rule Path
Disallow /

webmasterworldforumbot

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

websauger

Rule Path
Disallow /

website quester

Rule Path
Disallow /

webster pro

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webzip

Rule Path
Disallow /

wotbox

Rule Path
Disallow /

wsr-agent

Rule Path
Disallow /

www-collector-e

Rule Path
Disallow /

xenu

Rule Path
Disallow /

zao

Rule Path
Disallow /

zeus

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

coccoc

Rule Path
Disallow /

incutio

Rule Path
Disallow /

lmspider

Rule Path
Disallow /

memorybot

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

serf

Rule Path
Disallow /

unknown

Rule Path
Disallow /

uptime files

Rule Path
Disallow /

rogerbot

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

sitebot

Rule Path
Disallow /

lexxebot

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

ichiro

Rule Path
Disallow /

twiceler

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

fetch

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

wget

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

archive.org_bot

Rule Path
Disallow /

ia_archiver-web.archive.org

Rule Path
Disallow /

Other Records

Field Value
sitemap https://riff.net.pl/1_index_sitemap.xml

Comments

  • robots.txt automatically generated by PrestaShop e-commerce open-source solution
  • https://www.prestashop.com - https://www.prestashop.com/forums
  • This file is to prevent the crawling and indexing of certain parts
  • of your site by web crawlers and spiders run by sites like Yahoo!
  • and Google. By telling these "robots" where not to go on your site,
  • you save bandwidth and server resources.
  • For more information about the robots.txt standard, see:
  • https://www.robotstxt.org/robotstxt.html
  • Allow Directives
  • Private pages
  • Directories for kids.izpol.pl
  • Files
  • Sitemap