aquaportail.com
robots.txt

Robots Exclusion Standard data for aquaportail.com

Resource Scan

Scan Details

Site Domain aquaportail.com
Base Domain aquaportail.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-09-25T11:42:51+00:00
Next Scan 2024-11-24T11:42:51+00:00

Last Successful Scan

Scanned2024-07-28T11:41:28+00:00
URL https://aquaportail.com/robots.txt
Redirect https://www.aquaportail.com/robots.txt
Redirect Domain www.aquaportail.com
Redirect Base aquaportail.com
Domain IPs 91.121.38.236
Redirect IPs 91.121.38.236
Response IP 91.121.38.236
Found Yes
Hash 1330ea629b4b36c88d2ebf95667294ac85f15a45c96ec351a430b619dc336ab5
SimHash 63037e670a8f

Groups

gptbot
chatgpt
voltron
amazonbot

Rule Path
Disallow /

abonti
aboundex
accoona-ai-agent
acunetix
afd-verbotsverfahren
ahrefsbot
aibot
aihitbot
aipbot
alexibot
alligator
allsubmitter
alphabot
alvinetspider
anarchie
anonymization.net
antenne hatena
apache-httpclient
apexoo
arachnophilia
archive.org_bot
aspseek
asterias
aspider/0.09
attach
auresys/1.0
autoemailspider

Rule Path
Disallow /

b2w/0.1
backdoorbot
backlink-ceck
backlink-check
backlinkcrawler
backrub/.
backstreet
backweb
badass
baiduspider
baiduspider-favo
baiduspider-news
baiduspider-video
baiduspider-image
bandit
barkrowler
batchftp
battleztar bazinga
bayspider
bbbike
bdcbot
bdfetch
becomebot/1.23
betabot
bigfoot
bitacle
big brother
bizbot003
bizbot04 kirk.overleaf.com
bizinformation
blackboard
black hole
blackwidow
blexbot
blow
blowfish
boardreader
bolt
botalot
brandprotect
brandwatch
bspider
bspider/1.0 libwww-perl/0.40
bubing
buddy
builtbottough
builtwith
bullseye
bunnyslippers
buzzsumo
bytespider

Rule Path
Disallow /

calculon
calif univ tools
catexplorador
cazoodlebot
ccbot
cegbfeieh
cerberian
check&get
cheesebot
cherrypicker
cherrypickerelite
cherrypickerelite/1.0
cherrypickerse
cherrypickerse/1.0
chinaclaw
christcrawler
chlooe
claritybot
cliqzbot
cloud mapping
coccocbot-web
cogentbot
cognitiveseo
collector
com.plumanalytics
converacrawler
copier
copyrightcheck
copyscape
cosmos
craftbot
crawler4j
crawler.feedback
crawl.sogou.com
crazywebcrawler
crescent
crescent internet toolpak http ole control
cshttp
curious
custo
customexchangebrowser

Rule Path
Disallow /

databasedrivermysqli
datacha0s
dblbot
demandbase-bot
demon
depspid
deusu
devil
diffbot
digincore
digitalpebble
diibot
dirbuster
disco
discobot
disco pump 3.1
discoverybot
dittospyder
dnyzbot
domainappender
domaincrawler
domainsigmacrawler
domainstatsbot
dotbot
download wonder
dragonfly
drip
dts agent

Rule Path
Disallow /

e-societyrobot
easydl
ebingbong
ecatch
eccp/1.0
echo!
ecxi
edgeio-retriever
eirgrabber
emailcollector
email siphon
email wolf
emailsiphon
emailwolf
enigmabot
entrieva
erocrawler
evc-batch
evil
explorersearch
express webpictures
extlinksbot
extractor
extractorpro
extreme picture finder
eyenetie
ezooms

Rule Path
Disallow /

f-bot test pilot
factbot
fdm
fhscan
filangy
fimap
findlinks
firefox/7.0
flamingo_searchengine
flashget
flunky
foobot
franklin locator 1.8
freeuploader
frontpage
furlbot
furl search
fyberspider
fyrebot

Rule Path
Disallow /

galaxybot
genieo
germcrawler
getintent
getright
getweb
gigablast
gigabot
g-i-g-a-b-o-t
girafabot
go-ahead-got-it
gotit
gozilla
go!zilla
gqbi hnxupsxgfgnx berxjteu
grabber
grabnet
grafula
grapefx
green research, inc.
gridbot
gt::www

Rule Path
Disallow /

haansoft
haosouspider
harvest
harvest/1.5
havij
headmasterseo
heritrix
hloader
hmview
htmlparser
http::lite
httplib
httrack
httrack 3.0
huaweisymantecspider
humanlinks
hybridbot

Rule Path
Disallow /

iblog
ichiro
idbot
id-search
igentia
ilsebot
image fetch
image sucker
implisensebot
indeedbot
indy library
infobot
infonavirobot
infotekies
ingrid/0.1
instabid
intelliseek
interget
internet ninja
internetseer
internetvista monitor
ips-agent
iria
irlbot
iskanie
isc systems
istellabot

Rule Path
Disallow /

java
java/1.4.2_04
java1.3.1_08
java/1.5.0
jamesbot
jbrofuzz
jennybot
jetcar
jikespider
jobo/1.3
joc web spider
joomla
jorgee
justview
jyxobot

Rule Path
Disallow /

kenjin spider
keyword density
kozmosbot

Rule Path
Disallow /

lanshanbot
larbin
larbin_2.6.3
leechftp
leechget
lexibot
lftp
libweb
libweb/clshttp
linguee
libwhisker
lightspeedsystems
likse
linkchecker
linkdexbot
linkextractorpro
linklooker
linkpadbot
linkscan
linkscan/8.1a unix
linksmanager
linkwalker
linqiametadatadownloaderbot
linqiarssbot
linqiascrapebot
lipperhey
litemage_walker
lmspider
lnspiderguy
ltx71 - (http://ltx71.com/)
lwp-request
lwp::simple
lwp-trivial
lwp-trivial/1.34

Rule Path
Disallow /

mail.ru
magnet
mag-net
magpie-crawler
mail.ru_bot
majestic12
markmonitor
markwatch
masscan
mass downloader
mata hari
mauibot
meanpathbot
mediatoolkitbot
mediawords
megaindex.ru
metauri
mfc_tear_sample
microsoft data access
microsoft url control
microsoft url control - 5.01.4511
microsoft url control - 6.00.8169
midown tool
miixpc
miixpc/4.2
minirank/2.0
missigua locator
mister pix
mj12bot
mlbot
msnptc/1.0
moget
moget/2.1
morfeus fucking scanner
mozilla/picgrabber
mozilla/5.0 (compatible; heritrix/1.3.0 +http://crawler.archive.org)
mozilla/4.0 (compatible; cerberian drtrs version-3.1-build-17)
mozilla/3.0 (compatible; indy library)
mr.4x3
msfrontpage
msiecrawler
msproxy/2.0
msrabot
ms search 4.0 robot
ms search 5.0 robot
ms web services client protocol
muhstik-scan
musobot
mvaclient

Rule Path
Disallow /

name intelligence
nameprotect
nasa search
naverbot
naverbot-1.0
navroad
nearsite
needle
nessus
netants
netattache
netcraft
netestate ne crawler
netlyzer
netmechanic
netspider
nettrack
net vampire
netvibes
netzip
newsblur
nextgensearchbot
nhsewalker
nibbler
nicebot
nicerspro
niki-bot
nikto
nimblecrawler
ninja
nmap
nomad
noxtrumbot
npbot
nutch

Rule Path
Disallow /

obot
octopus
offline explorer
offline navigator
omniexplorer_bot
openbot
opendns domain crawler
openfind
openfind data gathere
openindexspider
openlinkprofiler
openvas
openvas
outclicksbot
outfoxbot
outfoxmelonbot

Rule Path
Disallow /

pageanalyzer
page analyzer
pagegrabber
page scorer
pagescorer
panscient
papa foto
pavuk
pcbrowser
pecl::http
peoplepal
perplexitybot
phantom-bot
phpcrawl
picscout
picsearch
picturefinder
pimonster
piplbot
pi-monster
pinterestbot
pixray
pleasecrawl
plumanalytics
pockey
poe-component-client-http
port huron labs
probethenet
program shareware
propowerbot
propowerbot/2.14
prowebwalker
psbot
pump
pxbroker
pycurl

Rule Path
Disallow /

quepasacreep
queryn metasearch
quick-crawler

Rule Path
Disallow /

rankactive
rankactivelinkbot
rankflex
rankingbot
rankingbot2
rankivabot
rankurbot
realdownload
reaper
rebelmouse
recorder
redesscrapy
reget
repomonkey
ripper
river valley inc
rma
rocketcrawler
rogerbot
rvldsmcmuwduwxnltyrm x snl

Rule Path
Disallow /

safetynet robot 0.1
salesintelligent
sbider
scanalert
scanbot
scan.lol
scooter
scooter-3.0.fs
scoutjet
scrapy
screaming
screenerbot
searchestate
searchmetricsbot
searchpreview
semrush
semrushbot
seokicks
seolyticscrawler
seomoz
seoprofiler
seoscanners
seostats
sexsearcher
seznambot
shodan
shopwiki
sightupbot
siphon
sistrix
sitebeam
sitebot
siteexplorer
siteimprove
sitelockspider
sitesnagger
sitesucker
site sucker
sitevigil
slackbot-linkexpanding
slysearch
smartdownload
smtbot
snake
snapbot
snooper/b97_01
snoopy
socialrankiobot
sogouspider
sogou web spider
sootle
sosospider
sottopop
spacebison
spammen
spankbot
spanner
spbot
speedy
speedy spider
spinn3r
sproose
sputnikbot
sqlmap
sqlworm
sqworm
steeler
stripper
sucker
sucuri
suggybot
sumeetbot
superbot
superbot/2.6
superhttp
surfbot
surveybot
suzuran
swiftbot
sysscan
szukacz
szukacz/1.4

Rule Path
Disallow /

t0phackteam
t8abot
takeout
teleport
teleportpro
telesoft
telesphoreo
telesphorep
the intraformant
thenomad
tighttwatbot
titan
toata
tocrawl/urldispatcher
toweyabot
tracemyfile
trackback
true_robot
true_robot/1.0
turingos
turnitin
turnitinbot
twengabot
twice
twiceler
typhoeus

Rule Path
Disallow /

unisterbot
urlpouls
urly.warning
urly warning
url_spider_pro

Rule Path
Disallow /

vacuum
vagabondo
vb project
vci
vci webviewer
vericitecrawler
vidiblescraper
virusdie
vivaldi
voideye
voil
voltron
voyager

Rule Path
Disallow /

walhello appie
wallpapers/3.0
wallpapershd
wasalive-bot
wbdbot
wbsearchbot
webalta
webauto
web auto
webbandit
webbandit/3.50
webcapture
webcollage
web collage
webcopier
webdav
webenhancer
web enhancer
webfetch
web fetch
webfuck
web fuck
webgo is
webimagecollector
webleacher
wells search ii
webmasterworldforumbot
webmeup-crawler
webmirror
webpix
web pix
webreaper
websauger
web sauger
webshag
websiteextractor
websitequester
website quester
webster
webster pro
webstripper
websucker
web sucker
webwhacker
webzip
web image collector
wep search 00
wesee
wget
whack
whacker
whatweb
who.is bot
widow
wikiofeedbot
wikiwix-bot
wikiwix-bot-3.0
winhttrack
wiseguys robot
wisenutbot
wonderbot
woobot
wotbox
wprecon
wpscan
www-collector-e
www-mechanize
www::mechanize
wwwoffle
wwwster
wysigot

Rule Path
Disallow /

x09mozilla
x22mozilla
xaldon_webspider
xaldon webspider
xenu
xenu's
xenu's link sleuth
xpymep1.exe

Rule Path
Disallow /

yacy
yacybot
yodaobot
youdaobot
yrspider

Rule Path
Disallow /

zade
zao-crawler
zauba
zauba.io
zermelo
zeus
zgrab
zitebot
zmeu
zookabot
zumbot
zyborg

Rule Path
Disallow /

*

Rule Path
Disallow /cgi-bin/
Disallow /tmp/
Disallow /cache/
Disallow /class/
Disallow /include/
Disallow /install/
Disallow /kernel/
Disallow /language/
Disallow /templates_c/
Disallow /modules/newbb/post.php
Disallow /modules/newbb/newtopic.php
Disallow /modules/newbb/search.php
Disallow /modules/wordbook/autocomplete.php
Disallow /modules/aquabdd/autocomplete.php
Disallow /search.php
Disallow /google.php

Warnings

  • 6 invalid lines.