vesti.lv
robots.txt

Robots Exclusion Standard data for vesti.lv

Resource Scan

Scan Details

Site Domain vesti.lv
Base Domain vesti.lv
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a server error.
Last Scan2024-06-27T12:03:47+00:00
Next Scan 2024-09-25T12:03:47+00:00

Last Successful Scan

Scanned2023-05-31T19:31:52+00:00
URL https://vesti.lv/robots.txt
Domain IPs 35.190.32.254
Response IP 35.190.32.254
Found Yes
Hash 0164f01e1b93447465f86f40906cb404e42a26bb5f6c7e3f8dedae7609a5f73f
SimHash ab7ec1b55abf

Groups

*

Rule Path
Disallow /administrator/
Disallow /cache/
Disallow /components/
Disallow /includes/
Disallow /installation/
Disallow /language/
Disallow /libraries/
Disallow /media/
Disallow /modules/
Disallow /plugins/
Disallow /templates/
Disallow /tmp/
Disallow /xmlrpc/
Disallow /win/
Disallow /phorum/

Other Records

Field Value
crawl-delay 20

mj12bot

Rule Path
Disallow /

slurp

Rule Path
Disallow /

msnbot

Rule Path
Disallow /

voilabot beta 1.2

Rule Path
Disallow /

linkedinbot/1.0

Rule Path
Disallow /

mlbot

Rule Path
Disallow /

zanrancrawler/0.2

Rule Path
Disallow /

synthesio

Rule Path
Disallow /

sheenbot/sheenbot-1.0.4

Rule Path
Disallow /

ina dlweb

Rule Path
Disallow /

lijit crawler

Rule Path
Disallow /

dtsearchspider

Rule Path
Disallow /

mj12bot/v1.3.2

Rule Path
Disallow /

voilabot

Rule Path
Disallow /

scoutjet

Rule Path
Disallow /

naverbot

Rule Path
Disallow /

yeti

Rule Path
Disallow /

isense

Rule Path
Disallow /

isense bot v1.21

Rule Path
Disallow /

spbot/2.0.2

Rule Path
Disallow /

spbot

Rule Path
Disallow /

sindicebot

Rule Path
Disallow /

aggregator:spinn3r (spinn3r 3.1)

Rule Path
Disallow /

spinn3r

Rule Path
Disallow /

konqueror/3.5

Rule Path
Disallow /

psbot/0.1

Rule Path
Disallow /

psbot

Rule Path
Disallow /

beetween/nutch-1.0 (beetween crawler)

Rule Path
Disallow /

beetween/nutch-1.0

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

tineye/1.1

Rule Path
Disallow /

speedy

Rule Path
Disallow /

datapatrol/nutch-1.0 (datapatrol indexer from garlik; http://www.garlik.com/products.php; crawler at garlik dot com)

Rule Path
Disallow /

sbider/nutch-1.0-dev (http://www.sitesell.com/sbider.html)

Rule Path
Disallow /

discobot/1.1

Rule Path
Disallow /

infoaxe./nutch-0.9

Rule Path
Disallow /

piplbot; http://pipl.com/bot/

Rule Path
Disallow /

camelhttpstream/1.0 evolution/2.26.1

Rule Path
Disallow /