aquaportail.com
robots.txt
Robots Exclusion Standard data for aquaportail.com
Resource Scan
Scan Details
Site Domain | aquaportail.com |
Base Domain | aquaportail.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-09-25T11:42:51+00:00 |
Next Scan | 2024-11-24T11:42:51+00:00 |
Last Successful Scan
Scanned | 2024-07-28T11:41:28+00:00 |
URL | https://aquaportail.com/robots.txt |
Redirect | https://www.aquaportail.com/robots.txt |
Redirect Domain | www.aquaportail.com |
Redirect Base | aquaportail.com |
Domain IPs | 91.121.38.236 |
Redirect IPs | 91.121.38.236 |
Response IP | 91.121.38.236 |
Found | Yes |
Hash | 1330ea629b4b36c88d2ebf95667294ac85f15a45c96ec351a430b619dc336ab5 |
SimHash | 63037e670a8f |
Groups
abonti
aboundex
accoona-ai-agent
acunetix
afd-verbotsverfahren
ahrefsbot
aibot
aihitbot
aipbot
alexibot
alligator
allsubmitter
alphabot
alvinetspider
anarchie
anonymization.net
antenne hatena
apache-httpclient
apexoo
arachnophilia
archive.org_bot
aspseek
asterias
aspider/0.09
attach
auresys/1.0
autoemailspider
Rule | Path |
---|---|
Disallow | / |
b2w/0.1
backdoorbot
backlink-ceck
backlink-check
backlinkcrawler
backrub/.
backstreet
backweb
badass
baiduspider
baiduspider-favo
baiduspider-news
baiduspider-video
baiduspider-image
bandit
barkrowler
batchftp
battleztar bazinga
bayspider
bbbike
bdcbot
bdfetch
becomebot/1.23
betabot
bigfoot
bitacle
big brother
bizbot003
bizbot04 kirk.overleaf.com
bizinformation
blackboard
black hole
blackwidow
blexbot
blow
blowfish
boardreader
bolt
botalot
brandprotect
brandwatch
bspider
bspider/1.0 libwww-perl/0.40
bubing
buddy
builtbottough
builtwith
bullseye
bunnyslippers
buzzsumo
bytespider
Rule | Path |
---|---|
Disallow | / |
calculon
calif univ tools
catexplorador
cazoodlebot
ccbot
cegbfeieh
cerberian
check&get
cheesebot
cherrypicker
cherrypickerelite
cherrypickerelite/1.0
cherrypickerse
cherrypickerse/1.0
chinaclaw
christcrawler
chlooe
claritybot
cliqzbot
cloud mapping
coccocbot-web
cogentbot
cognitiveseo
collector
com.plumanalytics
converacrawler
copier
copyrightcheck
copyscape
cosmos
craftbot
crawler4j
crawler.feedback
crawl.sogou.com
crazywebcrawler
crescent
crescent internet toolpak http ole control
cshttp
curious
custo
customexchangebrowser
Rule | Path |
---|---|
Disallow | / |
databasedrivermysqli
datacha0s
dblbot
demandbase-bot
demon
depspid
deusu
devil
diffbot
digincore
digitalpebble
diibot
dirbuster
disco
discobot
disco pump 3.1
discoverybot
dittospyder
dnyzbot
domainappender
domaincrawler
domainsigmacrawler
domainstatsbot
dotbot
download wonder
dragonfly
drip
dts agent
Rule | Path |
---|---|
Disallow | / |
e-societyrobot
easydl
ebingbong
ecatch
eccp/1.0
echo!
ecxi
edgeio-retriever
eirgrabber
emailcollector
email siphon
email wolf
emailsiphon
emailwolf
enigmabot
entrieva
erocrawler
evc-batch
evil
explorersearch
express webpictures
extlinksbot
extractor
extractorpro
extreme picture finder
eyenetie
ezooms
Rule | Path |
---|---|
Disallow | / |
f-bot test pilot
factbot
fdm
fhscan
filangy
fimap
findlinks
firefox/7.0
flamingo_searchengine
flashget
flunky
foobot
franklin locator 1.8
freeuploader
frontpage
furlbot
furl search
fyberspider
fyrebot
Rule | Path |
---|---|
Disallow | / |
galaxybot
genieo
germcrawler
getintent
getright
getweb
gigablast
gigabot
g-i-g-a-b-o-t
girafabot
go-ahead-got-it
gotit
gozilla
go!zilla
gqbi hnxupsxgfgnx berxjteu
grabber
grabnet
grafula
grapefx
green research, inc.
gridbot
gt::www
Rule | Path |
---|---|
Disallow | / |
haansoft
haosouspider
harvest
harvest/1.5
havij
headmasterseo
heritrix
hloader
hmview
htmlparser
http::lite
httplib
httrack
httrack 3.0
huaweisymantecspider
humanlinks
hybridbot
Rule | Path |
---|---|
Disallow | / |
iblog
ichiro
idbot
id-search
igentia
ilsebot
image fetch
image sucker
implisensebot
indeedbot
indy library
infobot
infonavirobot
infotekies
ingrid/0.1
instabid
intelliseek
interget
internet ninja
internetseer
internetvista monitor
ips-agent
iria
irlbot
iskanie
isc systems
istellabot
Rule | Path |
---|---|
Disallow | / |
java
java/1.4.2_04
java1.3.1_08
java/1.5.0
jamesbot
jbrofuzz
jennybot
jetcar
jikespider
jobo/1.3
joc web spider
joomla
jorgee
justview
jyxobot
Rule | Path |
---|---|
Disallow | / |
lanshanbot
larbin
larbin_2.6.3
leechftp
leechget
lexibot
lftp
libweb
libweb/clshttp
linguee
libwhisker
lightspeedsystems
likse
linkchecker
linkdexbot
linkextractorpro
linklooker
linkpadbot
linkscan
linkscan/8.1a unix
linksmanager
linkwalker
linqiametadatadownloaderbot
linqiarssbot
linqiascrapebot
lipperhey
litemage_walker
lmspider
lnspiderguy
ltx71 - (http://ltx71.com/)
lwp-request
lwp::simple
lwp-trivial
lwp-trivial/1.34
Rule | Path |
---|---|
Disallow | / |
mail.ru
magnet
mag-net
magpie-crawler
mail.ru_bot
majestic12
markmonitor
markwatch
masscan
mass downloader
mata hari
mauibot
meanpathbot
mediatoolkitbot
mediawords
megaindex.ru
metauri
mfc_tear_sample
microsoft data access
microsoft url control
microsoft url control - 5.01.4511
microsoft url control - 6.00.8169
midown tool
miixpc
miixpc/4.2
minirank/2.0
missigua locator
mister pix
mj12bot
mlbot
msnptc/1.0
moget
moget/2.1
morfeus fucking scanner
mozilla/picgrabber
mozilla/5.0 (compatible; heritrix/1.3.0 +http://crawler.archive.org)
mozilla/4.0 (compatible; cerberian drtrs version-3.1-build-17)
mozilla/3.0 (compatible; indy library)
mr.4x3
msfrontpage
msiecrawler
msproxy/2.0
msrabot
ms search 4.0 robot
ms search 5.0 robot
ms web services client protocol
muhstik-scan
musobot
mvaclient
Rule | Path |
---|---|
Disallow | / |
name intelligence
nameprotect
nasa search
naverbot
naverbot-1.0
navroad
nearsite
needle
nessus
netants
netattache
netcraft
netestate ne crawler
netlyzer
netmechanic
netspider
nettrack
net vampire
netvibes
netzip
newsblur
nextgensearchbot
nhsewalker
nibbler
nicebot
nicerspro
niki-bot
nikto
nimblecrawler
ninja
nmap
nomad
noxtrumbot
npbot
nutch
Rule | Path |
---|---|
Disallow | / |
obot
octopus
offline explorer
offline navigator
omniexplorer_bot
openbot
opendns domain crawler
openfind
openfind data gathere
openindexspider
openlinkprofiler
openvas
openvas
outclicksbot
outfoxbot
outfoxmelonbot
Rule | Path |
---|---|
Disallow | / |
pageanalyzer
page analyzer
pagegrabber
page scorer
pagescorer
panscient
papa foto
pavuk
pcbrowser
pecl::http
peoplepal
perplexitybot
phantom-bot
phpcrawl
picscout
picsearch
picturefinder
pimonster
piplbot
pi-monster
pinterestbot
pixray
pleasecrawl
plumanalytics
pockey
poe-component-client-http
port huron labs
probethenet
program shareware
propowerbot
propowerbot/2.14
prowebwalker
psbot
pump
pxbroker
pycurl
Rule | Path |
---|---|
Disallow | / |
rankactive
rankactivelinkbot
rankflex
rankingbot
rankingbot2
rankivabot
rankurbot
realdownload
reaper
rebelmouse
recorder
redesscrapy
reget
repomonkey
ripper
river valley inc
rma
rocketcrawler
rogerbot
rvldsmcmuwduwxnltyrm x snl
Rule | Path |
---|---|
Disallow | / |
safetynet robot 0.1
salesintelligent
sbider
scanalert
scanbot
scan.lol
scooter
scooter-3.0.fs
scoutjet
scrapy
screaming
screenerbot
searchestate
searchmetricsbot
searchpreview
semrush
semrushbot
seokicks
seolyticscrawler
seomoz
seoprofiler
seoscanners
seostats
sexsearcher
seznambot
shodan
shopwiki
sightupbot
siphon
sistrix
sitebeam
sitebot
siteexplorer
siteimprove
sitelockspider
sitesnagger
sitesucker
site sucker
sitevigil
slackbot-linkexpanding
slysearch
smartdownload
smtbot
snake
snapbot
snooper/b97_01
snoopy
socialrankiobot
sogouspider
sogou web spider
sootle
sosospider
sottopop
spacebison
spammen
spankbot
spanner
spbot
speedy
speedy spider
spinn3r
sproose
sputnikbot
sqlmap
sqlworm
sqworm
steeler
stripper
sucker
sucuri
suggybot
sumeetbot
superbot
superbot/2.6
superhttp
surfbot
surveybot
suzuran
swiftbot
sysscan
szukacz
szukacz/1.4
Rule | Path |
---|---|
Disallow | / |
t0phackteam
t8abot
takeout
teleport
teleportpro
telesoft
telesphoreo
telesphorep
the intraformant
thenomad
tighttwatbot
titan
toata
tocrawl/urldispatcher
toweyabot
tracemyfile
trackback
true_robot
true_robot/1.0
turingos
turnitin
turnitinbot
twengabot
twice
twiceler
typhoeus
Rule | Path |
---|---|
Disallow | / |
vacuum
vagabondo
vb project
vci
vci webviewer
vericitecrawler
vidiblescraper
virusdie
vivaldi
voideye
voil
voltron
voyager
Rule | Path |
---|---|
Disallow | / |
walhello appie
wallpapers/3.0
wallpapershd
wasalive-bot
wbdbot
wbsearchbot
webalta
webauto
web auto
webbandit
webbandit/3.50
webcapture
webcollage
web collage
webcopier
webdav
webenhancer
web enhancer
webfetch
web fetch
webfuck
web fuck
webgo is
webimagecollector
webleacher
wells search ii
webmasterworldforumbot
webmeup-crawler
webmirror
webpix
web pix
webreaper
websauger
web sauger
webshag
websiteextractor
websitequester
website quester
webster
webster pro
webstripper
websucker
web sucker
webwhacker
webzip
web image collector
wep search 00
wesee
wget
whack
whacker
whatweb
who.is bot
widow
wikiofeedbot
wikiwix-bot
wikiwix-bot-3.0
winhttrack
wiseguys robot
wisenutbot
wonderbot
woobot
wotbox
wprecon
wpscan
www-collector-e
www-mechanize
www::mechanize
wwwoffle
wwwster
wysigot
Rule | Path |
---|---|
Disallow | / |
x09mozilla
x22mozilla
xaldon_webspider
xaldon webspider
xenu
xenu's
xenu's link sleuth
xpymep1.exe
Rule | Path |
---|---|
Disallow | / |
*
Rule | Path |
---|---|
Disallow | /cgi-bin/ |
Disallow | /tmp/ |
Disallow | /cache/ |
Disallow | /class/ |
Disallow | /include/ |
Disallow | /install/ |
Disallow | /kernel/ |
Disallow | /language/ |
Disallow | /templates_c/ |
Disallow | /modules/newbb/post.php |
Disallow | /modules/newbb/newtopic.php |
Disallow | /modules/newbb/search.php |
Disallow | /modules/wordbook/autocomplete.php |
Disallow | /modules/aquabdd/autocomplete.php |
Disallow | /search.php |
Disallow | /google.php |
Warnings
- 6 invalid lines.