sprzegla24.pl
robots.txt

Robots Exclusion Standard data for sprzegla24.pl

Resource Scan

Scan Details

Site Domain sprzegla24.pl
Base Domain sprzegla24.pl
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-09-14T19:02:48+00:00
Next Scan 2024-11-13T19:02:48+00:00

Last Successful Scan

Scanned2024-06-24T19:01:44+00:00
URL https://sprzegla24.pl/robots.txt
Domain IPs 185.11.102.17
Response IP 185.11.102.17
Found Yes
Hash e19bf63093e1c93054b3b4404dd7f5b547bc8234c0c6fa853682114824db926e
SimHash 735f736b0321

Groups

*

Rule Path
Allow */modules/*.css
Allow */modules/*.js
Allow */modules/*.png
Disallow /*?orderby=
Disallow /*?orderway=
Disallow /*?tag=
Disallow /*?id_currency=
Disallow /*?search_query=
Disallow /*?back=
Disallow /*?n=
Disallow /*%26orderby%3D
Disallow /*%26orderway%3D
Disallow /*%26tag%3D
Disallow /*%26id_currency%3D
Disallow /*%26search_query%3D
Disallow /*%26back%3D
Disallow /*%26n%3D
Disallow /*controller%3Daddresses
Disallow /*controller%3Daddress
Disallow /*controller%3Dauthentication
Disallow /*controller%3Dcart
Disallow /*controller%3Ddiscount
Disallow /*controller%3Dfooter
Disallow /*controller%3Dget-file
Disallow /*controller%3Dheader
Disallow /*controller%3Dhistory
Disallow /*controller%3Didentity
Disallow /*controller%3Dimages.inc
Disallow /*controller%3Dinit
Disallow /*controller%3Dmy-account
Disallow /*controller%3Dorder
Disallow /*controller%3Dorder-opc
Disallow /*controller%3Dorder-slip
Disallow /*controller%3Dorder-detail
Disallow /*controller%3Dorder-follow
Disallow /*controller%3Dorder-return
Disallow /*controller%3Dorder-confirmation
Disallow /*controller%3Dpagination
Disallow /*controller%3Dpassword
Disallow /*controller%3Dpdf-invoice
Disallow /*controller%3Dpdf-order-return
Disallow /*controller%3Dpdf-order-slip
Disallow /*controller%3Dproduct-sort
Disallow /*controller%3Dsearch
Disallow /*controller%3Dstatistics
Disallow /*controller%3Dattachment
Disallow /*controller%3Dguest-tracking
Disallow */classes/
Disallow */config/
Disallow */download/
Disallow */mails/
Disallow */modules/
Disallow */translations/
Disallow */tools/
Disallow /*odzyskiwanie-hasla
Disallow /*adres
Disallow /*adresy
Disallow /*logowanie
Disallow /*koszyk
Disallow /*rabaty
Disallow /*historia-zamowien
Disallow /*dane-osobiste
Disallow /*moje-konto
Disallow /*sledzenie-zamowienia
Disallow /*pokwitowania
Disallow /*zamowienie
Disallow /*szukaj
Disallow /*szybkie-zakupy
Disallow /*sledzenie-zamowienia-gosc
Disallow /*potwierdzenie-zamowienia

aboundex
accelobot
add\ catalog
ahrefsbot
aihitbot
asterias
awcheckbot
backdoorbot/1.0
backlinkcrawler
baiduspider
black hole
blowfish/1.0
botalot
brandwatch\.net
builtbottough
bullseye/1.0
bunnyslippers
butterfly
catchbot
cegbfeieh
charlotte
cheesebot
cherrypicker
cherrypickerelite/1.0
cherrypickerse/1.0
clipish
comodo
comodo-certificates-spider
compspybot
copyrightcheck
cosmos
crescent
crescent internet toolpak http ole control v.1.0
curious
dataprovider\.com
dinoping
discoverybot
dittospyder
dotbot
dotnetdotcom
dow\ jones\ searchbot
easouspider
emailcollector
emailsiphon
emailwolf
erocrawler
exabot
extractorpro
ezinearticleslinkscanner
ezooms
foobot
ftrf\:\ friendly
gigabot
harvest/1.5
hloader
httplib
humanlinks
ia_archiver
indy\ library
infonavirobot
ip\-web\-crawler\.com
jakarta\ commons-httpclient
jennybot
jikespider
kenjin spider
keyword density/0.9
lexibot
libweb/clshttp
libwww-perl
lindex\.com
linkdex\.com
linkextractorpro
linkscan/8.1a unix
linkwalker
lipperhey
lnspiderguy
ltbot
lwp-trivial
lwp-trivial/1.34
magpie\-crawler
mata hari
microsoft url control - 5.01.4511
microsoft url control - 6.00.8169
miixpc
miixpc/4.2
mister pix
moget
moget/2.1
mozilla/4
mozilla/4.0 (compatible; bullseye; windows 95)
mozilla/4.0 (compatible; msie 4.0; windows 2000)
mozilla/4.0 (compatible; msie 4.0; windows 95)
mozilla/4.0 (compatible; msie 4.0; windows 98)
mozilla/4.0 (compatible; msie 4.0; windows me)
mozilla/4.0 (compatible; msie 4.0; windows nt)
mozilla/4.0 (compatible; msie 4.0; windows xp)
mozilla/5
msie\ or\ firefox\ mutant
ncbot
netants
netcraftsurveyagent
netestate\ ne\ crawler
netseer
nextgensearchbot
nicerspro
ocelli
offline explorer
openfind
openfind data gathere
openwebindex
pagesinventory
peoplepal
procogseobot
propowerbot/2.14
prowebwalker
proximic
purebot
queryn metasearch
queryseekerspider
repomonkey
repomonkey bait & tackle/v1.01
riddler
rma
rogerbot
rojerbot
screenerbot
searchmetrics
semrushbot
seoengworldbot
shopwiki
sistrix
sitebot
sitesnagger
snoopy
socialsearcher
sogou
solomonobot
sosospider
spankbot
spanner
speedy
surveybot
suzuran
szukacz/1.4
teleport
teleportpro
telesoft
the intraformant
the\ incutio\ xml-rpc\ php\ library
thenomad
tighttwatbot
titan
tocrawl/urldispatcher
true_robot
true_robot/1.0
turingos
turnitinbot
urly warning
vci
vci webviewer vci webviewer win32
visaduhoc\.info
wbsearchbot
web image collector
webauto
webbandit
webbandit/3.50
webcapture
webcopier
webenhancer
webindetail\.com
webmasterworldforumbot
websauger
website quester
websitetheweb\.com
webster pro
webstripper
webzip
webzip/4.0
wget
wget/1.5.3
wget/1.6
wotbot
www\.integromedb\.org
www-collector-e
xenu's
xenu's link sleuth 1.1c
xpymep\.exe
yamanalab-robot
yisouspider
yodaobot
youdaobot
zend_http_client
zeus
zeus 32297 webster pro v2.9 win32
zmeu
zumbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.sprzegla24.pl/sitemap.xml

Comments

  • robots.txt automaticaly generated by PrestaShop e-commerce open-source solution
  • http://www.prestashop.com - http://www.prestashop.com/forums
  • This file is to prevent the crawling and indexing of certain parts
  • of your site by web crawlers and spiders run by sites like Yahoo!
  • and Google and Bing. By telling these "robots" where not to go on your site,
  • you save bandwidth and server resources..
  • For more information about the robots.txt standard, see:
  • http://www.robotstxt.org/robotstxt.html
  • Allow Directives
  • Private pages
  • Directories
  • Files

Warnings

  • 18 invalid lines.