sanomoon.com
robots.txt

Robots Exclusion Standard data for sanomoon.com

Resource Scan

Scan Details

Site Domain sanomoon.com
Base Domain sanomoon.com
Scan Status Ok
Last Scan2024-09-13T13:38:19+00:00
Next Scan 2024-10-13T13:38:19+00:00

Last Scan

Scanned2024-09-13T13:38:19+00:00
URL http://sanomoon.com/robots.txt
Domain IPs 114.29.236.243
Response IP 114.29.236.243
Found Yes
Hash b9fa9db2bc9c10bb7231d1852239d18c736969fa64c49401b9d0609825cbb6dc
SimHash 4cf59731ff57

Groups

acme-spider
bbot
openfind
webs
sygol
urlck
kdd
meshexplorer
patric
gama
w3index
ahoythehomepagefinder
arachnophilia
araneo
architext
aretha
aspider
backrub
blackwidow
cactvschemistryspider
checkbot
churl
core
deweb
eit
emacs
emcspider
ferret
finnish
fish
francoroute
funnelweb
getbot
geturl
harvest
hi
htdig
htmlgobble
hyperdecontextualizer
ibm
incywincy
infoseek
infoseeksidewinder
intelliagent
israelisearch
jobot
joebot
jubii
jumpstation
katipo
lycos
macworm
momspider
monster
netcarta
nhse
nomad
northstar
nzexplorer
octopus
perignator
phantom
pioneer
pitkow
pka
python
rbse
resumerobot
roverbot
safetynetrobot
senrigan
sgscout
sitetech
spry
tarspider
tcl
titan
tkwww
ucsd
visionsearch
w3m2
wanderer
webcopy
webfetcher
webfoot
weblayers
weblinker
webmirror
websnarf
webvac
webwalk
webwatch
wmir
wombat
worm
felix
inspectorwww
netmechanic
abcdatos
alkaline
appie
arale
araybot
ariadne
askjeeves
atn
atomz
auresys
bayspider
bigbrother
bjaaland
blindekuh
borg-bot
boxseabot
brightnet
bspider
calif
cassandra
cgireader
christcrawler
cienciaficcion
cmc
combine
confuzzledbot
coolbot
cosmos
cruiser
cusco
cyberspyder
cydralspider
desertrealm
dienstspider
digger
diibot
directhit
dnabot
download_express
dragonbot
dwcp
e-collector
elfinbot
esculapio
esther
evliyacelebi
fastcrawler
fetchrover
fido
fireball
fouineur
freecrawl
gazz
gcreep
golem
googlebot
grapnel
gromit
gulliver
gulperbot
hambot
havindex
hometown
iajabot
iconoclast
ilse
imagelock
informant
infospider
irobot
javabee
jbot
jcrawler
jobo
kapsi
ko_yappo_robot
labelgrabber.txt
larbin
legs
linkidator
linkwalker
magpie
marvin
mattie
mediafox
mindcrawler
mnogosearch
motor
msnbot
muncher
muninn
muscatferret
mwdsearch
myweb
ndspider
nederland.zoek
netscoop
newscan-online
objectssearch
occam
orb_search
packrat
parasite
pegasus
perlcrawler
phpdig
piltdownman
pimptrain
pjspider
plumtreewebaccessor
poppi
portalb
psbot
puu
raven
rhcs
rixbot
roadrunner
robbie
robi
robocrawl
robofox
robozilla
rules
scooter
search_au
search-info
searchprocess
shaihulud
sift
simbot
site-valet
skymob
slurp
smartspider
snooper
solbot
spider_monkey
spiderbot
spiderline
spiderview
ssearcher
suke
suntek
sven
tach_bw
techbot
templeton
titin
tlspider
udmsearch
uptimebot
us
valkyrie
verticrawl
victoria
voidbot
voyager
vwbot
wallpaper
wapspider
webcatcher
webinator
webmoose
webreader
webreaper
webspider
webwalker
wget
whatuseek
whowhere
wired-digital
wlm
wolp
wwwc
wz101
xget
anthill
arks
bloodhound
collective
ebiness
fdse
griffon
iron33
kilroy
linkscan
lockon
logo_gif
merzscope
moget
ontospider
pageboy
shaggy
slcrawler
speedy
spiderman
tarantula
webbandit
webquest
googlebot
slurp
msnbot

Rule Path
Allow /
Disallow /cgi-bin/
Disallow /player/

Other Records

Field Value
crawl-delay 10