lumories.hr
robots.txt

Robots Exclusion Standard data for lumories.hr

Resource Scan

Scan Details

Site Domain lumories.hr
Base Domain lumories.hr
Scan Status Ok
Last Scan2025-11-28T22:49:24+00:00
Next Scan 2025-12-28T22:49:24+00:00

Last Scan

Scanned2025-11-28T22:49:24+00:00
URL https://lumories.hr/robots.txt
Redirect https://www.lumories.hr/robots.txt
Redirect Domain www.lumories.hr
Redirect Base lumories.hr
Domain IPs 34.111.187.42
Redirect IPs 34.111.187.42
Response IP 34.111.187.42
Found Yes
Hash cbb6157e2eaf2331fef30597a13b3dd74f5e05a4f55b0b123b72b91c5d7b288c
SimHash 4326f0684f9b

Groups

*
adidxbot

Product Comment
adidxbot explicit mentioning of bing ads bot. It does not adhere to the agent wildcard.
Rule Path Comment
Disallow /checkout/ -
Disallow /customer/ -
Disallow /wishlist/ -
Disallow /catalogsearch/ -
Disallow /app/ -
Disallow /lib/ -
Disallow /*.php$ -
Disallow /*SID%3D -
Disallow /index.php/ -
Disallow /banner/ajax -
Disallow /lw_related/ajax -
Disallow /documents/ -
Disallow */article/*nocache%3D -
Disallow /bloom/widget/* -
Disallow /*?*&* block irrelevant filter combinations
Disallow /*?*~* block irrelevant filter value combinations
Disallow /*?*~* block tilde
Disallow /*?*&*&*p=* block irrelevant filter combinations with pagination
Disallow /*?*~*&p=* block irrelevant filter value combinations with pagination
Disallow /*?*~*&p=* block tilde
Allow /*?*&p=* allow relevant filter with pagination
Disallow /*?*___* -
Disallow /*?*category=* -
Disallow /*?*light_bulb=* -
Disallow /*?SESSIONNAME=* -
Allow /*utm_* -
Allow /*lw_view%3Dnocontent* -
Allow /*display%3Dproducts* -
Allow /*lw_om_view%3Drecotop* -
Allow /*block_sku%3D* -

amazonbot
blexbot
barkrowler
buck
genomecrawlerd
halobot
iboubot
imagesiftbot
pingdom
pingdom.com_bot
pingdompagespeed
sebot-wa
sansec
sansec security monitor
scrapy
semrushbot
thinkbot
timpibot
twitterbot
uptimerobot
velenpublicwebcrawler
ahrefs
dataforseo
dotbot
keydrop
libredtail-http
megaindex
mj12bot
petalbot
python-requests
semrush
serpstatbot
snapchat
sqlmap
trendictionbot
webmeup-crawler
yandex
zgrab

Rule Path
Disallow *

gptbot
applebot

Rule Path
Disallow /*?*p=*

adsbot-google-mobile

Rule Path
Disallow /bloom/widget/*

adsbot-google

Rule Path
Disallow /bloom/widget/*

mediapartners-google

Rule Path
Disallow /bloom/widget/*

googlebot-image

Rule Path
Disallow /bloom/widget/*

googlebot-video

Rule Path
Disallow /bloom/widget/*

pinterestbot

No rules defined. All paths allowed.

Other Records

Field Value
crawl-delay 1

Other Records

Field Value
sitemap https://www.lumories.hr/media/sitemap/sitemap_hr_hr.xml

Comments

  • Disallow not needed crawlers
  • Disallow pagination crawls where not needed
  • Disallow widget crawls