warketingdigital.net
robots.txt

Robots Exclusion Standard data for warketingdigital.net

Resource Scan

Scan Details

Site Domain warketingdigital.net
Base Domain warketingdigital.net
Scan Status Ok
Last Scan2024-09-23T19:47:49+00:00
Next Scan 2024-09-30T19:47:49+00:00

Last Scan

Scanned2024-09-23T19:47:49+00:00
URL https://warketingdigital.net/robots.txt
Redirect https://www.warketingdigital.net/robots.txt
Redirect Domain www.warketingdigital.net
Redirect Base warketingdigital.net
Domain IPs 104.21.71.11, 172.67.168.249, 2606:4700:3032::ac43:a8f9, 2606:4700:3035::6815:470b
Redirect IPs 104.21.71.11, 172.67.168.249, 2606:4700:3032::ac43:a8f9, 2606:4700:3035::6815:470b
Response IP 172.67.168.249
Found Yes
Hash a19fae0090e972c640eb6871944d997a4e23ff701174539ef5dba6a21803cd96
SimHash 6f48a4b223b7

Groups

googlebot
bingbot
baidu
slurp
yandexbot
duckduckbot

Rule Path
Allow /

*

Rule Path
Disallow /wp-admin/
Disallow /wp-includes/
Disallow /wp-login.php
Disallow /author/
Disallow /search
Disallow /category/*/*
Disallow */trackback
Disallow /feed/
Disallow /*/feed/
Disallow /comments
Disallow /*/comments
Disallow /cgi-bin/
Disallow /*.cgi$
Disallow /*.gz$
Disallow /*.inc$
Disallow /*.php$
Disallow /*.swf$
Disallow /*.wmv$
Disallow /*.xhtml$
Disallow /*?PageSpeed=noscript
Disallow /*?swcfpc=1
Disallow /*?fbclid=

feedburner
feedly

Rule Path
Allow /feed/

admantx
aibot
alittle client
aspseek
abonti
aboundex
aboundexbot
acunetix
adstxtcrawlertp
afd-verbotsverfahren
aihitbot
aipbot
alexibot
allsubmitter
alligator
alphabot
anarchie
anarchy
anarchy99
ankit
anthill
apexoo
aspiegel
asterias
atomseobot
attach
awariorssbot
awariosmartbot
bbbike
bdcbot
bdfetch
blexbot
backdoorbot
backstreet
backweb
backlink-ceck
backlinkcrawler
badass
bandit
barkrowler
batchftp
battleztar bazinga
betabot
bigfoot
bitacle
blackwidow
black hole
blackboard
blow
blowfish
boardreader
bolt
botalot
brandprotect
brandwatch
buck
buddy
builtbottough
builtwith
bullseye
bunnyslippers
buzzsumo
bytespider
catexplorador
ccbot
code87
cshttp
calculon
cazoodlebot
cegbfeieh
censysinspect
cheteam
cheesebot
cherrypicker
chinaclaw
chlooe
citoid
claritybot
cliqzbot
cloud mapping
cocolyzebot
cogentbot
collector
copier
copyrightcheck
copyscape
cosmos
craftbot
crawling at home project
crazywebcrawler
crescent
crunchbot
curious
custo
cyotekwebcopy
dblbot
diibot
dsearch
dts agent
datacha0s
databasedrivermysqli
demon
deusu
devil
digincore
digitalpebble
dirbuster
disco
discobot
discoverybot
dispatch
dittospyder
dnbcrawler-analytics
dnyzbot
domcopbot
domainappender
domaincrawler
domainsigmacrawler
domainstatsbot
domains project
dotbot
download wonder
dragonfly
drip
eccp/1.0
email siphon
email wolf
easydl
ebingbong
ecxi
eirgrabber
erocrawler
evil
exabot
express webpictures
extlinksbot
extractor
extractorpro
extreme picture finder
eyenetie
ezooms
fdm
fhscan
femtosearchbot
fimap
firefox/7.0
flashget
flunky
foobot
freeuploader
frontpage
fuzz
fyberspider
g-i-g-a-b-o-t
gptbot
gt::www
galaxybot
genieo
germcrawler
getright
getweb
getintent
gigabot
go!zilla
go-ahead-got-it
gozilla
gotit
grabnet
grabber
grafula
grapefx
grapeshotcrawler
gridbot
headmasterseo
hmview
htmlparser
http::lite
httrack
haansoft
haosouspider
harvest
havij
heritrix
hloader
honolulubot
humanlinks
hybridbot
idbte4m
idbot
irlbot
iblog
id-search
ilsebot
image fetch
image sucker
indeedbot
indy library
infonavirobot
infotekies
intelliseek
interget
internetseer
internet ninja
iria
iskanie
istellabot
joc web spider
jamesbot
jbrofuzz
jennybot
jetcar
jetty
jikespider
joomla
jorgee
justview
jyxobot
kenjin spider
keybot translation-search-machine
keyword density
kinza
kozmosbot
lnspiderguy
lwp::simple
lanshanbot
larbin
leap
leechftp
leechget
lexibot
lftp
libweb
libwhisker
liebaofast
lightspeedsystems
likse
linkscan
linkwalker
linkbot
linkextractorpro
linkpadbot
linksmanager
linqiametadatadownloaderbot
linqiarssbot
linqiascrapebot
lipperhey
lipperhey spider
litemage_walker
lmspider
ltx71
mfc_tear_sample
mgfbot
mgtbot
mj12bot
mj12bot
msobot
mspasbot
magpie
mabot
mabot
mantisbot
maxbot
megabot
megaspider
metalink
metrobot
microsearch
micromat
mixbot
moby
mobitop
mojeekbot
monsterbot
moonbot
morebot
msearch
mspyder
mungbot
mybot
mycrawler
myspider
nbi
netcrawler
netindex
netmechanic
netsilk
netspider
nfbot
nginx
nutch
nymia
obot
octoparse
omega
online links
openweb spider
oracle
origincrawler
osfbot
outpost
pagebot
pagecrawler
pagegrader
pageripper
panda
pangu
parsbot
parsers
pbot
phantasma
picviz
pingdom
planetary
plugg
powerbot
proweb
proxe
pulsar
qibot
qool
quester
quickbot
quickspider
r2d2
r8bot
rblbot
raptor
rascal
reget
reptar
residualbot
retrobot
robo
robotchecker
robojob
robozilla
romen
rssbot
rssfetch
s-meta
s2b
s4bot
sabre
sambot
sano
scorpion
scour
screaming frog
screener
scrupulous
seeker
semrushbot
senatbot
serverbot
shalla
silktide
simplebot
simplespider
simpleweb
sitebot
sitecheck
sitemap
sitereader
sitespider
siteware
sitespy
skynet
skyscraper
sleuth
smartbot
snagbot
snowball
snuffleupagus
socialspider
soft2web
spambot
spiderbot
spidr
splurge
spooky
spider-v
spyder
sqbot
sqweb
sucker
sugrubot
sumobot
superspider
surfbot
susie
swiftnet
sybose
system
t3bot
tangle
tbot
teebot
terabot
the spider
tiddler
tiki
tineye
tjbot
tokkyo
topbot
topweb
trackerbot
translit
turingbot
tweepy
twitbot
ubot
uboot
ubot
ultrabot
unspider
upbot
upcrawler
uplink
urlfuzzer
usabilla
uptimerobot
uxbot
vbot
vspider
vault
vintner
vision
viskit
vortex
vscrawler
w3c
w4bot
w4spider
wbot
wc3bot
web auto
web extractor
web funnel
web grab
web reader
web scroller
web walker
webaider
webscraper
websucker
websurfer
webworker
webwanderer
weco
webroamer
webzilla
wibot
wilbot
wmspider
x-bot
x-bot 2.0
xabber
xcra
xem
xenobot
xovo
ybot
yandex
yandexbot
yandeximages
yisouspider
yuabot
yxbot
z-bot
zbtbot
zephyr
zigzag
zobot
zoie
zoid
zsbot

No rules defined. All paths allowed.

Other Records

Field Value
sitemap https://www.warketingdigital.net/sitemap_index.xml
sitemap https://www.warketingdigital.net/news-sitemap.xml

Comments

  • Indiquer les sitemaps
  • Autoriser les moteurs sur les parties utiles
  • Bloquer les fichiers et répertoires sensibles
  • Autoriser l'accès RSS pour certains bots
  • Bloquer les bots indésirables

Warnings

  • 5 invalid lines.