maformation.fr
robots.txt

Robots Exclusion Standard data for maformation.fr

Resource Scan

Scan Details

Site Domain maformation.fr
Base Domain maformation.fr
Scan Status Ok
Last Scan2024-05-25T00:38:44+00:00
Next Scan 2024-06-24T00:38:44+00:00

Last Scan

Scanned2024-05-25T00:38:44+00:00
URL https://maformation.fr/robots.txt
Redirect https://www.maformation.fr/robots.txt
Redirect Domain www.maformation.fr
Redirect Base maformation.fr
Domain IPs 217.70.184.55
Redirect IPs 51.103.19.211
Response IP 51.103.19.211
Found Yes
Hash 54f4c9dd3b57bfdb225da06c1867f2b466ac7ba462e45db297cbf2e8f117c647
SimHash 78f1d2b3c8e7

Groups

adsbot-google
adsbot-google-mobile

Rule Path
Allow /*?
Disallow

*

Rule Path
Allow /formation/
Allow /centres/
Allow /formations/metier_agent-d-escale.html?PageNumber=
Allow /formations/metier_developpeur-web.html?PageNumber=
Allow /formations/metier_windows.html?PageNumber=
Allow /formations/metier_webmaster.html?PageNumber=
Disallow /*?
Disallow */1000$
Disallow /contact
Disallow /confirmation
Disallow /completerdemande
Disallow /redirection
Disallow /rebond
Disallow /actualites/recherche
Disallow /alternance/recherche$
Disallow /alternance/recherche?
Disallow /centres/recherche
Disallow /centres/annuaire
Disallow /formation/recherche$
Disallow /formation/recherche?
Disallow /formationscentre
Disallow /diplomes/intitule
Disallow /diplomes/diplome
Allow /*utm_source%3D
Allow /*xtor%3D

aitcsrobot/1.1
alexibot
aqua_products
arachnophilia
aspider/0.09
asterias
atraxsolutions
auresys/1.0
b2w/0.1
backdoorbot
backdoorbot/1.0
backrub/.
bad bots
baiduspider-video
becomebot
big brother
bizbot003
bizbot04 kirk.overleaf.com
black hole
black.hole
blackwidow
blexbot
blowfish
blowfish/1.0
bookmark search tool
bot mailto:craftbot@yahoo.com
botalot
botrighthere
bspider/1.0 libwww-perl/0.40
builtbottough
bullseye
bullseye/1.0
bunnyslippers
cactvs chemistry spider
camontspider
ccbot
cegbfeieh
changedetection
checkbot/x.xx lwp/5.x
cheesebot
cherrypicker
cherrypickerelite/1.0
cherrypickerse/1.0
chinaclaw
cliqzbot
combine/0.0
conceptbot/0.3
copernic
copyrightcheck
cosmos
crescent
crescent internet toolpak http ole control v.1.0
custo
cyberpatrol sitecat webbot
cyberspyder/2.1
daumoa
deweb/1.01
diibot
disco
disco pump 3.0
disco pump 3.2
discobot
discofinder
dittospyder
doc
dotbot
download demon
download demon/3.2.0.8
download demon/3.5.0.11
download ninja
dumbot
ecatch
ecatch/3.0
eirgrabber
emailcollector
emailsiphon
emailwolf
enigmabot
enterprise_search
enterprise_search/1.0
erocrawler
es
exabot
explorersearch
express webpictures
express webpictures (www.express-soft.com)
extractorpro
eyenetie
fairad client
felixide/1.0
fetch
fido/0.9 harvest/1.4.pl2
fish-search-robot
flaming attackbot
flashget
flashget webwasher 3.2
foobot
freecrawl
freefind
frontpage
frontpage [nc,or]
gaisbot
gcreep/1.0
getright
getright/2.11
getright/3.1
getright/3.2
getright/3.3
getright/3.3.3
getright/3.3.4
getright/4.0.0
getright/4.1.0
getright/4.1.1
getright/4.1.2
getright/4.2
getright/4.2b (portuguxeas)
getright/4.2c
getright/4.3
getright/4.5
getright/4.5a
getright/4.5b
getright/4.5b1
getright/4.5b2
getright/4.5b3
getright/4.5b6
getright/4.5b7
getright/4.5c
getright/4.5d
getright/4.5e
getright/5.0beta1
getright/5.0beta2
geturl.rexx v1.05
getweb!
go!zilla
go!zilla (www.gozilla.com)
go!zilla 3.3 (www.gozilla.com)
go!zilla 3.5 (www.gozilla.com)
go-ahead-got-it
golem/1.1
grabnet
grafula
gromit/1.0
grub
grub-client
hmhkki/0.2
happyfunbot
harvest
harvest/1.5
hatena antenna
hazel's ferret web hopper
heritrix
hloader
hmview
httplib
httrack
httrack [nc,or]
httrack 3.0
huaweisymantecspider
humanlinks
image stripper
image sucker
inagist.com url crawler
incywincy/1.0b1
indy library
indy library [nc,or]
infonavirobot
informant
ingrid/0.1
interget
internet ninja
internet ninja 4.0
internet ninja 5.0
internet ninja 6.0
iron33/1.0.2
israelisearch/1.0
iti spider
jennybot
jetbot
jetbot/1.0
jetcar
joc web spider
jubiirobot
jumpstation
k2spider
katipo/1.0
kenjin spider
kenjin.spider
keyword density/0.9
keyword.density
kit-fireball/2.0 libwww/5.0a
labelgrab/1.1
larbin
larbin (samualt9@bigfoot.com)
larbin samualt9@bigfoot.com
larbin_2.6.2 (kabura@sushi.com)
larbin_2.6.2 (larbin2.6.2@unspecified.mail)
larbin_2.6.2 (listonatccdotgatechdotedu)
larbin_2.6.2 (vitalbox1@hotmail.com)
larbin_2.6.2 kabura@sushi.com
larbin_2.6.2 larbin@correa.org
larbin_2.6.2 larbin2.6.2@unspecified.mail
larbin_2.6.2 listonatccdotgatechdotedu
larbin_2.6.2 vitalbox1@hotmail.com
leechftp
lexibot
libweb/clshttp
libwww
linkextractorpro
linklooker
linko
linkscan/8.1a unix
linkscan/8.1a.unix
linkwalker
lnspiderguy
lwp-trivial
lwp-trivial/1.34
mass downloader
mass downloader/2.2
mata hari
mata.hari
mediafox/x.y
merzscope
metagopher
microsoft url control
microsoft url control - 5.01.4511
microsoft url control - 6.00.8169
microsoft.url
microsoft.url.control
midown tool
miixpc
miixpc/4.2
mister pix
mister pix ii 2.01
mister pix ii 2.02a
mister pix version.dll
mister.pix
moget
moget/2.1
momspider/1.00 libwww-perl/0.40
motor/0.2
mozilla/4.0 (compatible; bullseye; windows 95)
msiecrawler
naver
navroad
nearsite
neosciocrawler
net vampire
net vampire/3.0
netants
netants/1.10
netants/1.23
netants/1.24
netants/1.25
netcarta cyberpilot pro
netmechanic
netscoop/1.0 libwww/5.0a
netspider
netzip
netzip downloader 1.0 win32(nov 12 1998)
netzip-downloader/1.0.62 (win32; dec 7 1998)
netzippy+(http://www.innerprise.net/usp-spider.asp)
nhsewalker/3.0
nicerspro
nomad-v2.x
npbot
nutch
occam/1.0
octopus
offline explorer
offline explorer/1.2
offline explorer/1.4
offline explorer/1.6
offline explorer/1.7
offline explorer/1.9
offline explorer/2.0
offline explorer/2.1
offline explorer/2.3
offline explorer/2.4
offline explorer/2.5
offline navigator
offline.explorer
ogspider
open text site crawler v1.0
openbot
openfind
openfind data gathere
openfind data gatherer
oracle ultra search
pagegrabber
panscient.com
papa foto
pavuk
pcbrowser
perman
pgp-ka/1.2
propowerbot/2.14
prowebwalker
psbot
python-urllib
quepasacreep
queryn metasearch
queryn.metasearch
r6_commentreader
r6_feedfetcher
radiation retriever 1.1
realdownload
realdownload/4.0.0.40
realdownload/4.0.0.41
realdownload/4.0.0.42
reget
repomonkey
repomonkey bait & tackle/v1.01
resume robot
rma
roverbot
safetynet robot 0.1
sapphirewebcrawler
scoutjet
searchpreview
senrigan/xxxxxx
sitecheck.internetseer.com
sitesnagger
slysearch
smartdownload
smartdownload/1.2.76 (win32; apr 1 1999)
smartdownload/1.2.77 (win32; aug 17 1999)
smartdownload/1.2.77 (win32; feb 1 2000)
smartdownload/1.2.77 (win32; jun 19 2001)
snooper/b97_01
solbot/1.0 lwp/5.07
sootle
spankbot
spanner
spanner/1.0 (linux 2.0.27 i586)
spyder3.microsys.com
sqworm/2.9.85-beta (beta_release; 20011115-775; i686-pc-linux
stanford
stanford comp sci
superbot
superbot/3.0 (win32)
superbot/3.1 (win32)
superhttp
superhttp/1.0
surfbot
suzuran
szukacz/1.4
takeout
teleport
teleport pro
teleport pro/1.29
teleport pro/1.29.1590
teleport pro/1.29.1634
teleport pro/1.29.1718
teleport pro/1.29.1820
teleport pro/1.29.1847
teleportpro
telesoft
the intraformant
the.intraformant
thenomad
tighttwatbot
titan
tocrawl/urldispatcher
trendictionbot
true_robot
true_robot/1.0
turingos
turnitinbot
turnitinbot/1.5
twiceler
ubicrawler
ucsd-crawler
unisterbot
unwindfetchor/1.0
url control
url_spider_pro
urlck/1.2.3
urlspiderpro
urly warning
urly.warning
valkyrie/1.0 libwww-perl/0.40
vbseo
vci
vci webviewer vci webviewer win32
voideye
web image collector
web sucker
web.image.collector
webauto
webauto/3.40 (win98; i)
webbandit
webbandit/3.50
webcapture 2.0
webcopier
webcopier v.2.2
webcopier v2.5
webcopier v2.6
webcopier v2.7a
webcopier v2.8
webcopier v3.0
webcopier v3.0.1
webcopier v3.2
webcopier v3.2a
webcopy/
webcrawler/3.0 robot libwww/5.0a
webemailextrac.*
webenhancer
webferret
webfetch
webfetch/2.1.0
webfetcher/0.8,
webgo is
weblayers/0.0
webleacher
weblinker/0.0 libwww-perl/0.1
webmasterworld extractor
webmasterworldforumbot
webmoose/0.0.0000
webreaper
webreaper [info@webreaper.net]
webreaper [webreaper@otway.com]
webreaper v9.1 - www.otway.com/webreaper
webreaper v9.7 - www.webreaper.net
webreaper v9.8 - www.webreaper.net
webreaper vwebreaper v7.3 - www,otway.com/webreaper
webs@recruit.co.jp
websauger
websauger 1.20b
websauger 1.20j
websauger 1.20k
website extractor
website quester
website quester - www.asona.org
website quester - www.esalesbiz.com/extra/
website.quester
webster pro
webster.pro
webstripper
webstripper/2.03
webstripper/2.10
webstripper/2.12
webstripper/2.13
webstripper/2.15
webstripper/2.16
webstripper/2.19
webvac
webvac/1.0
webwalk
webwalker
webwalker/1.10
webwatch
webwhacker
webzip
webzip/2.75 (http://www.spidersoft.com)
webzip/3.65 (http://www.spidersoft.com)
webzip/3.80 (http://www.spidersoft.com)
webzip/4.0
webzip/4.0 (http://www.spidersoft.com)
webzip/4.1 (http://www.spidersoft.com)
webzip/4.21
webzip/4.21 (http://www.spidersoft.com)
webzip/5.0
webzip/5.0 (http://www.spidersoft.com)
webzip/5.0 pr1 (http://www.spidersoft.com)
wget
wget/1.4.0
wget/1.5.2
wget/1.5.3
wget/1.6
wget/1.7
wget/1.8
wget/1.8.1
wget/1.8.1+cvs
wget/1.8.2
wget/1.9-beta
whowhere robot
widow
wired-digital-newsbot/1.5
www collector
www.freeloader.com.
www-collector-e
wwwoffle
wwwwanderer v3.0
xaldon webspider
xaldon webspider 2.5.b3
xaldon_webspider
xenu
xenu's
xenu's link sleuth 1.1c
xget/0.7
yahoo pipes 1.0
yahoo pipes 2.0
yandex
yandexsomething
yasaklibot
yes
yesupbot
yeti
zao
zealbot
zeus
zeus 11389 webster pro v2.9 win32
zeus 11652 webster pro v2.9 win32
zeus 18018 webster pro v2.9 win32
zeus 26378 webster pro v2.9 win32
zeus 30747 webster pro v2.9 win32
zeus 32297 webster pro v2.9 win32
zeus 39206 webster pro v2.9 win32
zeus 41641 webster pro v2.9 win32
zeus 44238 webster pro v2.9 win32
zeus 51070 webster pro v2.9 win32
zeus 51674 webster pro v2.9 win32
zeus 51837 webster pro v2.9 win32
zeus 63567 webster pro v2.9 win32
zeus 6694 webster pro v2.9 win32
zeus 82016 webster pro v2.9 win32
zeus 82900 webster pro v2.9 win32
zeus 84842 webster pro v2.9 win32
zeus 90872 webster pro v2.9 win32
zeus 94934 webster pro v2.9 win32
zeus 95245 webster pro v2.9 win32
zeus 95351 webster pro v2.9 win32
zeus 97371 webster pro v2.9 win32
zeus link scout
zyborg

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.maformation.fr/sitemap.xml