hlrnet.com
robots.txt

Robots Exclusion Standard data for hlrnet.com

Resource Scan

Scan Details

Site Domain hlrnet.com
Base Domain hlrnet.com
Scan Status Ok
Last Scan2024-10-25T17:34:02+00:00
Next Scan 2024-11-24T17:34:02+00:00

Last Scan

Scanned2024-10-25T17:34:02+00:00
URL https://hlrnet.com/robots.txt
Domain IPs 192.254.236.138
Response IP 192.254.236.138
Found Yes
Hash 8d4cbe3892a3313d8fa966578c380505d83802ac49a912e663bb1d2dcc467a1f
SimHash 43d7b2f96bb7

Groups

*

Rule Path
Disallow robots.txt
Disallow /bilbao/
Disallow /cv/
Disallow /eao/
Disallow /eao.be/
Disallow /ebp1/
Disallow /ebp2/
Disallow /ebp3/
Disallow /ex/
Disallow /ej/
Disallow /encuestas/
Disallow /gezondheidsorg/
Disallow /sites/actu-en/
Disallow /sites/actu-es/
Disallow /sites/actu-fr/
Disallow /s11/
Disallow /s23/
Disallow /s42/

bot*

Rule Path
Disallow /

spider

Rule Path
Disallow /

crawl

Rule Path
Disallow /

robot

Rule Path
Disallow /

bot[+:,\.\;\/\\-]

Rule Path
Disallow /

discovery

Rule Path
Disallow /

voyager

Rule Path
Disallow /

checker

Rule Path
Disallow /

harvest

Rule Path
Disallow /

funwebproduct

Rule Path
Disallow /

scooter

Rule Path
Disallow /

naver

Rule Path
Disallow /

dumbot

Rule Path
Disallow /

hatena antenna

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

grub

Rule Path
Disallow /

looksmart

Rule Path
Disallow /

webzip

Rule Path
Disallow /

larbin

Rule Path
Disallow /

b2w/0.1

Rule Path
Disallow /

psbot

Rule Path
Disallow /

python-urllib

Rule Path
Disallow /

googlebot-image

Rule Path
Disallow /

netmechanic

Rule Path
Disallow /

url_spider_pro

Rule Path
Disallow /

cherrypicker

Rule Path
Disallow /

emailcollector

Rule Path
Disallow /

emailsiphon

Rule Path
Disallow /

webbandit

Rule Path
Disallow /

emailwolf

Rule Path
Disallow /

extractorpro

Rule Path
Disallow /

copyrightcheck

Rule Path
Disallow /

crescent

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

prowebwalker

Rule Path
Disallow /

cheesebot

Rule Path
Disallow /

lnspiderguy

Rule Path
Disallow /

mozilla

Rule Path
Disallow /

mozilla

Rule Path
Disallow /

mozilla/3

Rule Path
Disallow /

mozilla/4

Rule Path
Disallow /

mozilla/5

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

miixpc

Rule Path
Disallow /

telesoft

Rule Path
Disallow /

website quester

Rule Path
Disallow /

moget/2.1

Rule Path
Disallow /

webzip/4.0

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

websauger

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

netants

Rule Path
Disallow /

mister pix

Rule Path
Disallow /

webauto

Rule Path
Disallow /

thenomad

Rule Path
Disallow /

www-collector-e

Rule Path
Disallow /

rma

Rule Path
Disallow /

libweb/clshttp

Rule Path
Disallow /

asterias

Rule Path
Disallow /

httplib

Rule Path
Disallow /

turingos

Rule Path
Disallow /

spanner

Rule Path
Disallow /

infonavirobot

Rule Path
Disallow /

harvest/1.5

Rule Path
Disallow /

bullseye/1.0

Rule Path
Disallow /

mozilla/4.0 (compatible; bullseye; windows 95)

Rule Path
Disallow /

crescent internet toolpak http ole control v.1.0

Rule Path
Disallow /

cherrypickerse/1.0

Rule Path
Disallow /

cherrypickerelite/1.0

Rule Path
Disallow /

webbandit/3.50

Rule Path
Disallow /

nicerspro

Rule Path
Disallow /

microsoft url control - 5.01.4511

Rule Path
Disallow /

dittospyder

Rule Path
Disallow /

foobot

Rule Path
Disallow /

webmasterworldforumbot

Rule Path
Disallow /

spankbot

Rule Path
Disallow /

botalot

Rule Path
Disallow /

lwp-trivial/1.34

Rule Path
Disallow /

lwp-trivial

Rule Path
Disallow /

bunnyslippers

Rule Path
Disallow /

microsoft url control - 6.00.8169

Rule Path
Disallow /

urly warning

Rule Path
Disallow /

wget/1.6

Rule Path
Disallow /

wget/1.5.3

Rule Path
Disallow /

wget

Rule Path
Disallow /

linkwalker

Rule Path
Disallow /

cosmos

Rule Path
Disallow /

moget

Rule Path
Disallow /

hloader

Rule Path
Disallow /

humanlinks

Rule Path
Disallow /

linkextractorpro

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

mata hari

Rule Path
Disallow /

lexibot

Rule Path
Disallow /

web image collector

Rule Path
Disallow /

the intraformant

Rule Path
Disallow /

true_robot/1.0

Rule Path
Disallow /

true_robot

Rule Path
Disallow /

blowfish/1.0

Rule Path
Disallow /

jennybot

Rule Path
Disallow /

miixpc/4.2

Rule Path
Disallow /

builtbottough

Rule Path
Disallow /

propowerbot/2.14

Rule Path
Disallow /

backdoorbot/1.0

Rule Path
Disallow /

tocrawl/urldispatcher

Rule Path
Disallow /

webenhancer

Rule Path
Disallow /

suzuran

Rule Path
Disallow /

vci webviewer vci webviewer win32

Rule Path
Disallow /

vci

Rule Path
Disallow /

szukacz/1.4

Rule Path
Disallow /

queryn metasearch

Rule Path
Disallow /

openfind data gathere

Rule Path
Disallow /

openfind

Rule Path
Disallow /

zeus

Rule Path
Disallow /

repomonkey bait & tackle/v1.01

Rule Path
Disallow /

repomonkey

Rule Path
Disallow /

microsoft url control

Rule Path
Disallow /

openbot

Rule Path
Disallow /

url control

Rule Path
Disallow /

zeus link scout

Rule Path
Disallow /

zeus 32297 webster pro v2.9 win32

Rule Path
Disallow /

webster pro

Rule Path
Disallow /

erocrawler

Rule Path
Disallow /

linkscan/8.1a unix

Rule Path
Disallow /

keyword density/0.9

Rule Path
Disallow /

kenjin spider

Rule Path
Disallow /

iron33/1.0.2

Rule Path
Disallow /

bookmark search tool

Rule Path
Disallow /

getright/4.2

Rule Path
Disallow /

fairad client

Rule Path
Disallow /

gaisbot

Rule Path
Disallow /

aqua_products

Rule Path
Disallow /

radiation retriever 1.1

Rule Path
Disallow /

webmasterworld extractor

Rule Path
Disallow /

flaming attackbot

Rule Path
Disallow /

oracle ultra search

Rule Path
Disallow /

msiecrawler

Rule Path
Disallow /

perman

Rule Path
Disallow /

searchpreview

Rule Path
Disallow /

sootle

Rule Path
Disallow /

es

Rule Path
Disallow /

enterprise_search/1.0

Rule Path
Disallow /

enterprise_search

Rule Path
Disallow /

infonavirobot

Rule Path
Disallow /

tv33_mercator

Rule Path
Disallow /

avsearch

Rule Path
Disallow /

mercator

Rule Path
Disallow /

scooter

Rule Path
Disallow /

slurp

Rule Path
Disallow /

searchenginelicencesheep

Rule Path
Disallow /

shadow

Rule Path
Disallow /

multitext

Rule Path
Disallow /

htdig

Rule Path
Disallow /

spider00.logika.net

Rule Path
Disallow /

teleport pro

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

webcopier v3.2a

Rule Path
Disallow /

offline navigator

Rule Path
Disallow /

getright

Rule Path
Disallow /

freshdownload

Rule Path
Disallow /

nitro downloader

Rule Path
Disallow /

leechftp

Rule Path
Disallow /

go!zilla

Rule Path
Disallow /

da

Rule Path
Disallow /

alligator

Rule Path
Disallow /

industry program

Rule Path
Disallow /

webzip

Rule Path
Disallow /

Comments

  • ACAP version=1.0
  • Disallow: /rss.xml
  • Created 150603 - edited 020921
  • User-agent: *
  • Disallow: robots.txt
  • Disallow: /afz/
  • Disallow: /boa/
  • Disallow: /ebp1/
  • Disallow: /ebp2/
  • Disallow: /ebp3/
  • Disallow: /ej/
  • Disallow: /ele/
  • Disallow: /ex/
  • Disallow: /ism/
  • Disallow: /langz/
  • Disallow: /s11/
  • Disallow: /s21/
  • Disallow: /s22/
  • Disallow: /s23/
  • Disallow: /a32/
  • Disallow: /s42/
  • Disallow: /semper/
  • Disallow: /vt/
  • Disallow: /wkr/
  • Disallow: /wv/
  • Disallow: /sitemap.htm
  • don't let search engines see the RSS feed, it's just confusing.
  • User-agent: FunWebProduct
  • Disallow: /
  • User-agent: msnbot
  • Disallow: /
  • User-agent: scooter
  • Disallow: /
  • User-agent: naver
  • Disallow: /
  • User-agent: dumbot
  • Disallow: /
  • User-agent: Hatena Antenna
  • Disallow: /
  • User-agent: grub-client
  • Disallow: /
  • User-agent: grub
  • Disallow: /
  • User-agent: looksmart
  • Disallow: /
  • User-agent: WebZip
  • Disallow: /
  • User-agent: larbin
  • Disallow: /
  • User-agent: b2w/0.1
  • Disallow: /
  • User-agent: psbot
  • Disallow: /
  • User-agent: Python-urllib
  • Disallow: /
  • User-agent: Googlebot-Image
  • Disallow: /
  • User-agent: NetMechanic
  • Disallow: /
  • User-agent: URL_Spider_Pro
  • Disallow: /
  • User-agent: CherryPicker
  • Disallow: /
  • User-agent: EmailCollector
  • Disallow: /
  • User-agent: EmailSiphon
  • Disallow: /
  • User-agent: WebBandit
  • Disallow: /
  • User-agent: EmailWolf
  • Disallow: /
  • User-agent: ExtractorPro
  • Disallow: /
  • User-agent: CopyRightCheck
  • Disallow: /
  • User-agent: Crescent
  • Disallow: /
  • User-agent: SiteSnagger
  • Disallow: /
  • User-agent: ProWebWalker
  • Disallow: /
  • User-agent: CheeseBot
  • Disallow: /
  • User-agent: LNSpiderguy
  • Disallow: /
  • User-agent: Mozilla
  • Disallow: /
  • User-agent: mozilla
  • Disallow: /
  • User-agent: mozilla/3
  • Disallow: /
  • User-agent: mozilla/4
  • Disallow: /
  • User-agent: mozilla/5
  • Disallow: /
  • User-agent: Teleport
  • Disallow: /
  • User-agent: TeleportPro
  • Disallow: /
  • User-agent: MIIxpc
  • Disallow: /
  • User-agent: Telesoft
  • Disallow: /
  • User-agent: Website Quester
  • Disallow: /
  • User-agent: moget/2.1
  • Disallow: /
  • User-agent: WebZip/4.0
  • Disallow: /
  • User-agent: WebStripper
  • Disallow: /
  • User-agent: WebSauger
  • Disallow: /
  • User-agent: WebCopier
  • Disallow: /
  • User-agent: NetAnts
  • Disallow: /
  • User-agent: Mister PiX
  • Disallow: /
  • User-agent: WebAuto
  • Disallow: /
  • User-agent: TheNomad
  • Disallow: /
  • User-agent: WWW-Collector-E
  • Disallow: /
  • User-agent: RMA
  • Disallow: /
  • User-agent: libWeb/clsHTTP
  • Disallow: /
  • User-agent: asterias
  • Disallow: /
  • User-agent: httplib
  • Disallow: /
  • User-agent: turingos
  • Disallow: /
  • User-agent: spanner
  • Disallow: /
  • User-agent: InfoNaviRobot
  • Disallow: /
  • User-agent: Harvest/1.5
  • Disallow: /
  • User-agent: Bullseye/1.0
  • Disallow: /
  • User-agent: Mozilla/4.0 (compatible; BullsEye; Windows 95)
  • Disallow: /
  • User-agent: Crescent Internet ToolPak HTTP OLE Control v.1.0
  • Disallow: /
  • User-agent: CherryPickerSE/1.0
  • Disallow: /
  • User-agent: CherryPickerElite/1.0
  • Disallow: /
  • User-agent: WebBandit/3.50
  • Disallow: /
  • User-agent: NICErsPRO
  • Disallow: /
  • User-agent: Microsoft URL Control - 5.01.4511
  • Disallow: /
  • User-agent: DittoSpyder
  • Disallow: /
  • User-agent: Foobot
  • Disallow: /
  • User-agent: WebmasterWorldForumBot
  • Disallow: /
  • User-agent: SpankBot
  • Disallow: /
  • User-agent: BotALot
  • Disallow: /
  • User-agent: lwp-trivial/1.34
  • Disallow: /
  • User-agent: lwp-trivial
  • Disallow: /
  • User-agent: BunnySlippers
  • Disallow: /
  • User-agent: Microsoft URL Control - 6.00.8169
  • Disallow: /
  • User-agent: URLy Warning
  • Disallow: /
  • User-agent: Wget/1.6
  • Disallow: /
  • User-agent: Wget/1.5.3
  • Disallow: /
  • User-agent: Wget
  • Disallow: /
  • User-agent: LinkWalker
  • Disallow: /
  • User-agent: cosmos
  • Disallow: /
  • User-agent: moget
  • Disallow: /
  • User-agent: hloader
  • Disallow: /
  • User-agent: humanlinks
  • Disallow: /
  • User-agent: LinkextractorPro
  • Disallow: /
  • User-agent: Offline Explorer
  • Disallow: /
  • User-agent: Mata Hari
  • Disallow: /
  • User-agent: LexiBot
  • Disallow: /
  • User-agent: Web Image Collector
  • Disallow: /
  • User-agent: The Intraformant
  • Disallow: /
  • User-agent: True_Robot/1.0
  • Disallow: /
  • User-agent: True_Robot
  • Disallow: /
  • User-agent: BlowFish/1.0
  • Disallow: /
  • User-agent: JennyBot
  • Disallow: /
  • User-agent: MIIxpc/4.2
  • Disallow: /
  • User-agent: BuiltBotTough
  • Disallow: /
  • User-agent: ProPowerBot/2.14
  • Disallow: /
  • User-agent: BackDoorBot/1.0
  • Disallow: /
  • User-agent: toCrawl/UrlDispatcher
  • Disallow: /
  • User-agent: WebEnhancer
  • Disallow: /
  • User-agent: suzuran
  • Disallow: /
  • User-agent: VCI WebViewer VCI WebViewer Win32
  • Disallow: /
  • User-agent: VCI
  • Disallow: /
  • User-agent: Szukacz/1.4
  • Disallow: /
  • User-agent: QueryN Metasearch
  • Disallow: /
  • User-agent: Openfind data gathere
  • Disallow: /
  • User-agent: Openfind
  • Disallow: /
  • User-agent: Zeus
  • Disallow: /
  • User-agent: RepoMonkey Bait & Tackle/v1.01
  • Disallow: /
  • User-agent: RepoMonkey
  • Disallow: /
  • User-agent: Microsoft URL Control
  • Disallow: /
  • User-agent: Openbot
  • Disallow: /
  • User-agent: URL Control
  • Disallow: /
  • User-agent: Zeus Link Scout
  • Disallow: /
  • User-agent: Zeus 32297 Webster Pro V2.9 Win32
  • Disallow: /
  • User-agent: Webster Pro
  • Disallow: /
  • User-agent: EroCrawler
  • Disallow: /
  • User-agent: LinkScan/8.1a Unix
  • Disallow: /
  • User-agent: Keyword Density/0.9
  • Disallow: /
  • User-agent: Kenjin Spider
  • Disallow: /
  • User-agent: Iron33/1.0.2
  • Disallow: /
  • User-agent: Bookmark search tool
  • Disallow: /
  • User-agent: GetRight/4.2
  • Disallow: /
  • User-agent: FairAd Client
  • Disallow: /
  • User-agent: Gaisbot
  • Disallow: /
  • User-agent: Aqua_Products
  • Disallow: /
  • User-agent: Radiation Retriever 1.1
  • Disallow: /
  • User-agent: WebmasterWorld Extractor
  • Disallow: /
  • User-agent: Flaming AttackBot
  • Disallow: /
  • User-agent: Oracle Ultra Search
  • Disallow: /
  • User-agent: MSIECrawler
  • Disallow: /
  • User-agent: PerMan
  • Disallow: /
  • User-agent: searchpreview
  • Disallow: /
  • User-agent: sootle
  • Disallow: /
  • User-agent: es
  • Disallow: /
  • User-agent: Enterprise_Search/1.0
  • Disallow: /
  • User-agent: Enterprise_Search
  • Disallow: /
  • User-agent: InfoNaviRobot
  • Disallow: /
  • User-agent: TV33_Mercator
  • Disallow: /
  • User-agent: AVSearch
  • Disallow: /
  • User-agent: Mercator
  • Disallow: /
  • User-agent: Scooter
  • Disallow: /
  • User-agent: Slurp
  • Disallow: /
  • User-agent: SearchengineLicenceSheep
  • Disallow: /
  • User-agent: shadow
  • Disallow: /
  • User-agent: MultiText
  • Disallow: /
  • User-agent: htdig
  • Disallow: /
  • User-agent: spider00.logika.net
  • Disallow: /
  • User-agent: Teleport Pro
  • Disallow: /
  • User-agent: WebCopier
  • Disallow: /
  • User-agent: WebCopier v3.2a
  • Disallow: /
  • User-agent: Offline Navigator
  • Disallow: /
  • User-agent: GetRight
  • Disallow: /
  • User-agent: FreshDownload
  • Disallow: /
  • User-agent: Nitro Downloader
  • Disallow: /
  • User-agent: LeechFTP
  • Disallow: /
  • User-agent: Go!Zilla
  • Disallow: /
  • User-agent: DA
  • Disallow: /
  • User-agent: Alligator
  • Disallow: /
  • User-agent: Industry Program
  • Disallow: /
  • User-agent: WebZip
  • Disallow: /
  • Disallow: /rss.xml
  • Created 150603 - edited 31082007

Warnings

  • `acap-crawler` is not a known field.
  • `acap-disallow-crawl` is not a known field.