xxlgastro.pl
robots.txt

Robots Exclusion Standard data for xxlgastro.pl

Resource Scan

Scan Details

Site Domain xxlgastro.pl
Base Domain xxlgastro.pl
Scan Status Ok
Last Scan2024-11-06T00:25:04+00:00
Next Scan 2024-12-06T00:25:04+00:00

Last Scan

Scanned2024-11-06T00:25:04+00:00
URL https://xxlgastro.pl/robots.txt
Redirect https://www.xxlgastro.pl/robots.txt
Redirect Domain www.xxlgastro.pl
Redirect Base xxlgastro.pl
Domain IPs 104.16.8.49, 104.17.156.30
Redirect IPs 104.16.8.49, 104.17.156.30, 2606:4700::6810:831, 2606:4700::6811:9c1e
Response IP 104.16.8.49
Found Yes
Hash 275af5db7bd5b47c3e7ac02131d6a3d03737e72b4320b2cec2b24361f04bdfd2
SimHash c34bfa6f2997

Groups

*

Rule Path
Disallow /admin
Disallow /en/account/
Disallow /pl/account/
Disallow /en/cart/
Disallow /pl/cart/
Disallow /en/compare/
Disallow /pl/compare/
Disallow /en/checkout/
Disallow /pl/checkout/

Other Records

Field Value
crawl-delay 2

abonti
aboundex
acoonbot
acunetix
adbeat_bot
addthis.com
adidxbot
admantx
ahrefsbot
aibot
aihitbot
alexibot
alligator
allsubmitter
angloinfo
antelope
apexoo
asterias
attach
backdoorbot
backstreet
backweb
badass
baid
baiduspider
baiduspider
bandit
batchftp
bbbike
beetlebot
bigfoot
billigerbot
binlar
bitlybot
black.hole
blackwidow
blexbot
blow
blowfish
blp_bbot
boardreader
bolt 0
bot for jce
bot mailto:craftbot@yahoo.com
botalot
buddy
builtbottough
bullseye
bunnyslippers
casper
cazoodlebot
ccbot
cegbfeieh
checkprivacy
cheesebot
cherrypicker
chinaclaw
chromeframe
clerkbot
cliqzbot
clshttp
cogentbot
cognitiveseo
collector
commoncrawler
comodo
copier
copyrightcheck
cosmos
cpython
crawler4j
crawlera
crazywebcrawler
crescent
cshttp
curious
curl
custo
cws_proxy
default browser 0
demon
deusu
devil
diavol
digext
digincore
diibot
disco
discobot
dittospyder
docomo
dotbot
download.demon
download.devil
download.wonder
download demo
dragonfly
drip
dts.agent
easouspider
easydl
ebingbong
ecatch
ecxi
eirgrabber
elmer
emailcollector
emailsiphon
emailwolf
erocrawler
exabot
exaleadcloudview
expertsearch
expertsearchspider
express
express webpictures
extract
extractor
extractorpro
eyenetie
ezooms
f2s
fastseek
feedfinder
feedlybot
fhscan
finbot
flamingo_searchengine
flappybot
flashget
flicky
flipboard
flipboardproxy
flunky
foobot
frontpage
g00g1e
galaxybot
genieo
genieo
getright
getweb!
gigablastopensource
go-ahead-got-it
go!zilla
gotit
gozaikbot
grab
grabber
grabnet
grafula
grapeshotcrawler
gt::www
gtb5
guzzle
harvest
harvest
headmasterseo
heritrix
hloader
hmview
homepagebot
htmlparser
http::lite
httrack
httrack
hubspot
humanlinks
ia_archiver
icarus6
id-search
idbot
ilsebot
image.stripper
image.sucker
image stripper
image sucker
imagefetch
indigonet
indy library
infonavirobot
infotekies
integromedb
intelliseek
interget
internet ninja
internetseer.com
iria
irlbot
isc systems irc search 2.1
jakarta
jakarta
java
jennybot
jetcar
jikespider
jobdiggerspider
joc
joc web spider
jooblebot
justview
jyxobot
kanagawa
kenjin.spider
keyword.density
kingspider
kmccrew
larbin
leechftp
leechget
lexibot
lftp
libweb
libwww
libwww-perl
likse
lingewoud
linkchecker
linkdexbot
linkextractorpro
linkscan
linkscrawler
linksmanager.com_bot
linkwalker
linkwalker
linqiarssbot
livelapbot
lnspiderguy
ltx71
lubbersbot
lwp-trivial
mag-net
magnet
mail.ru_bot
majestic12
markwatch
mass.downloader
mass downloader
masscan
mata.hari
maverick
maxthon$
mediatoolkitbot
megaindex
megaindex
memo
metauri
mfc_tear_sample
microsoft url control
microsoft.url
midown tool
miixpc
miner
missigua locator
mister pix
mj12bot
mozilla.*indy
mozilla.*newt
msfrontpage
msiecrawler
msnbot
nameprotect
navroad
nearsite
net vampire
netants
netcraft
netestate
netmechanic
netspider
netzip
nextgensearchbot
nicerspro
niki-bot
nimblecrawler
nimbostratus-bot
ninja
nmap
nmap
npbot
nutch
octopus
offline.explorer
offline.navigator
offline explorer
offline navigator
openfind
openindexspider
openlinkprofiler
openwebspider
orangebot
outfoxbot
owlin
pagegrabber
pagesinventory
panopta
panscient.com
papa foto
pavuk
pcbrowser
pecl::http
peoplepal
photon
phpcrawl
pixray
planetwork
pleasecrawl
pnamain.exe
pockey
podcastpartybot
prijsbest
probethenet
propowerbot
prowebwalker
proximic
psbot
pump
purebot
pycurl
python-requests
queryn.metasearch
queryseekerspider
r6_commentreader
r6_feedfetcher
realdownload
reaper
recorder
reget
repomonkey
riddler
ripper
rippers 0
rma
rogerbot
rssingbot
rv:1.9.1
ryzecrawler
safesearch
sbider
scanbot
scrapy
screaming
seamonkey$
search_robot
search.goo.ne.jp
searchmetricsbot
semrush
semrushbot
sentibot
seokicks
seokicks-robot
seoscanners
seznambot
showyoubot
sightupbot
siphon
sistrix
sitecheck.internetseer.com
siteexplorer.info
siteimprove
sitesnagger
sitesucker
skygrid
slackbot
slurp
slysearch
smartdownload
snake
snapbot
snoopy
sogou
sogou
sosospider
spacebison
spankbot
spanner
spaumbot
spbot
spinn4r
sqworm
steeler
stripper
sucker
sucker
superbot
superfeedr
superhttp
surdotlybot
surfbot
suzuran
szukacz
takeout
teleport
teleport pro
telesoft
the.intraformant
thenomad
tighttwatbot
tineye
tineye-bot
titan
toata dragostea mea pentru diavola
toplistbot
trendictionbot
trovitbot
true_robot
turingos
turnit
turnitinbot
twitterbot
uri::fetch
urllib
urly.warning
vacuum
vagabondo
vci
vidiblescraper
vikspider
voideye
voilabot
wallpapershd
wbsearchbot
web.image.collector
web image collector
web sucker
webalta
webauto
webbandit
webcollage
webcopier
webenhancer
webfetch
webfuck
webgo is
webleacher
webmasterworldforumbot
webpix
webreaper
websauger
webshag
website.extractor
website extractor
website quester
webster
webstripper
websucker
webwhacker
webzip
wells search ii
wep search
wesee
wget
whack
whacker
widow
winhttrack
wininet
wisenutbot
woobot
woopingbot
worldwebheritage.org
wotbox
wpscan
www-collector-e
www-mechanize
wwwoffle
xaldon
xaldon webspider
xenu
xovibot
yacybot
yandex
yandexbot
yisouspider
zade
zermelo
zeus
zh-cn
zmeu
zumbot
zyborg
zyborg

Rule Path
Disallow /
Disallow */go/product/
Disallow */go/category/
Disallow *sort%3D
Disallow *max%3D
Disallow *page
Disallow *brand%3D
Disallow *min%3D
Disallow /*?filter=
Disallow /*?limit=
Disallow /*?mode=

*

Rule Path
Disallow */go/product/
Disallow */go/category/
Disallow /pl/service/shipping-returns/
Disallow /pl/service/payment-methods/
Disallow /pl/service/general-terms-conditions/
Disallow /pl/service/privacy-policy/
Disallow /pl/account/password/
Disallow /pl/search*
Disallow */?sort=*
Disallow */services/
Disallow */search/
Disallow */cdn-cgi/
Disallow /*htm$

Other Records

Field Value
sitemap https://www.xxlgastro.pl/sitemap.xml

Warnings

  • 2 invalid lines.