siterice.hr
robots.txt

Robots Exclusion Standard data for siterice.hr

Resource Scan

Scan Details

Site Domain siterice.hr
Base Domain siterice.hr
Scan Status Ok
Last Scan2024-09-19T18:24:27+00:00
Next Scan 2024-10-03T18:24:27+00:00

Last Scan

Scanned2024-09-19T18:24:27+00:00
URL https://siterice.hr/robots.txt
Redirect https://www.siterice.hr/robots.txt
Redirect Domain www.siterice.hr
Redirect Base siterice.hr
Domain IPs 104.21.95.24, 172.67.169.73, 2606:4700:3032::6815:5f18, 2606:4700:3036::ac43:a949
Redirect IPs 104.21.95.24, 172.67.169.73, 2606:4700:3032::6815:5f18, 2606:4700:3036::ac43:a949
Response IP 172.67.169.73
Found Yes
Hash 05917040fb17be04c74833352648d45d3b6d9de90e4df310b9344411ac90c935
SimHash 6defe422053b

Groups

*

Rule Path Comment
Disallow /auth/facebook Authentication
Disallow /auth/mojeid Authentication
Disallow /auth/mojeid-oidc Authentication
Disallow /*?cancel_time= City overview action
Disallow /*?local= Language switcher
Disallow /clanstvo -
Disallow /rezervacije -
Disallow /poruke-siterice -
Disallow /poruke/new -
Disallow /en/orders -
Disallow /en/bookings -
Disallow /en/hlidacka-zpravy -
Disallow /en/zpravy/new -

xovibot

Rule Path
Disallow /

heritrix

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

seoscanners.net/1

Rule Path
Disallow /

seoscanners.net

Rule Path
Disallow /

seoscanners

Rule Path
Disallow /

baiduspider

Rule Path
Disallow /

baiduspider/2.0

Rule Path
Disallow /

ltx71

Rule Path
Disallow /

yandex

Rule Path
Allow /$
Disallow /

dotbot

Rule Path
Disallow /

mediatoolkitbot

Rule Path
Disallow /

smtbot

Rule Path
Disallow /

indeedbot

Rule Path
Disallow /

zoominfobot (zoominfobot at zoominfo dot com)

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

spiderling

Rule Path
Disallow /

metasr

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

sogou inst spider

Rule Path
Disallow /

seekport crawler

Rule Path
Disallow /

sputnikbot/2.3

Rule Path
Disallow /

megaindex.ru/2.0

Rule Path
Disallow /

barkrowler

Rule Path
Disallow /

tpradstxtcrawler

Rule Path
Disallow /

garlikcrawler

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

aspiegelbot

Rule Path
Disallow /

appnexusadstxtcrawler

Rule Path
Disallow /

adstxtcrawler

Rule Path
Disallow /

pimeyes.com

Rule Path
Disallow /

velenpublicwebcrawler

Rule Path
Disallow /

crawler4j

Rule Path
Disallow /

mauibot (crawler.feedback+wc@gmail.com)

Rule Path
Disallow /

punkspider

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.siterice.hr/sitemap_index.xml.gz

Comments

  • Non translated URLs
  • Translated URLs
  • Locale hr
  • Locale en