meteomincio.it
robots.txt
Robots Exclusion Standard data for meteomincio.it
Resource Scan
Scan Details
Site Domain | meteomincio.it |
Base Domain | meteomincio.it |
Scan Status | Ok |
Last Scan | 2024-10-18T18:18:06+00:00 |
Next Scan | 2024-11-17T18:18:06+00:00 |
Last Scan
Scanned | 2024-10-18T18:18:06+00:00 |
URL | https://meteomincio.it/robots.txt |
Redirect | https://www.meteomincio.it/robots.txt |
Redirect Domain | www.meteomincio.it |
Redirect Base | meteomincio.it |
Domain IPs | 89.46.108.57 |
Redirect IPs | 89.46.108.57 |
Response IP | 89.46.108.57 |
Found | Yes |
Hash | 0c6cccc5ac2c26d43e4a19f7146f4972174f0bd8650ff829a21033ff73c83375 |
SimHash | 72d11332e6b0 |
Groups
*
No rules defined. All paths allowed.
Other Records
Field | Value |
---|---|
crawl-delay | 2 |
google-inspectiontool
googlebot
googleother
bingbot
applebot
duckduckbot
facebot
exabot-thumbnails
exabot
swiftbot
slurp
ccbot/2.0 (https://commoncrawl.org/faq/)
ccbot/2.0 (http://commoncrawl.org/faq/)
ccbot/2.0
Rule | Path |
---|---|
Allow | / |
abonti
abonti/0.92
abot v1.0
aboutthedomain
add catalog
add catalog/2.1
advbot
advbot/2.0
ahrefsbot
ahrefs-bot
ahrefsbot/1.0
ahrefs-bot/1.0
ahrefs-bot/2.0
ahrefs-bot/3.0
ahrefs-bot/4.0
ahrefs-bot/5.0
aihitbot
aihitbot/2.9
anonymous/0.0
arachnida
associative spider
baiduspider
baidu spider
battleztar bazinga
battleztar bazinga/0.01
bdfetch
betabot
bieshu
bigli seo
blackboard safeassign
black hole
blazer 1.0
blexbot
blexbot/1.0
blp_bbot
blp_bbot/0.1
boia-accessibility-agent/pr 1.0
bot for jce
bot/0.1 (bot for jce)
bpimagewalker
bpimagewalker/2.0
bubing
buibui-bot
buibui-bot/1.0
ca-crawler
ca-crawler/1.0
cakephp
calypso v/0.01
calypso
cb/nutch-1.7
ccbot
ccbot/2.0
checkbot
checkgzipcompression.com
chushou
cloudservermarketspider
cloudservermarketspider/1.0
clushbot/3.x-binaryfury
cms crawler
cms crawler: http://www.cmscrawler.com
coccoc
coincornerbot
coincornerbot/1.1
copyscape
crawler4j
crazywebcrawler 0.9.0
crazywebcrawler 0.9.1
crazywebcrawler 0.9.7
crazywebcrawler
crazywebcrawler-spider
crowsnest
crowsnest/0.5
curious george - www.analyticsseo.com/crawler
curious george
cuwhois
cuwhois/1.0
dahoms
datagnionbot
deusu/5.0.2
digincore
digincore bot
dispatch/0.11.0
domain re-animator bot
domainappender /1.0
domainappender
domaincrawler/3.0
domainsigmacrawler
domainsigmacrawler/0.1
domnutch
domnutch-bot
domnutch-bot/nutch
domnutch-bot/nutch-1.0
dotbot
eccp/1.2.1
ecommercebot
enlle punto com/nutch-1.9
episerver link checker
euripbot
euripbot/2.0
evc/2.0
evc-batch
evc-batch/2.0
express webpictures
faraday v0.8.8
faraday
findxbot
findxbot/1.0
flamingo_searchengine
flipboard robot
getproxi.es-bot
getproxi.es-bot/1.1
gigablastopensource
gigablastopensource/1.0
girafabot
gluten free crawler
gluten free crawler/1.0
gptbot
griffinbot
grifinbot/0.01
gwpimages
gwpimages/1.0
haiula
haiula/1.4
haosouspider
hivemind
hostharvest
hostharvest/0.4.28
hrcrawler
hrcrawler/2.0
http://git.io/tl_s2w
http://www.checkprivacy.or.kr:6600/rs/privacy_enfaq.jsp
hubspot links crawler 1.0
hubspot webcrawler
hubspot
hunchan
hypercrawl
hypercrawl/0.2
icap-iod
icc-crawler
icc-crawler/2.0
ichiro robot
image.coccoc/1.0
image2play
image2play/0.1
indy library
insightscollector
insightscollector/0.1
insightscollector/0.1beta
integrity/5
internaetboten
internaetboten/0.99
irl crawler
james bot - webcrawler
james bot
jamesbot
jetbrains 5.0
jetbrains
kraken
kraken/0.1
kyoto-tohoku-crawler/v1
larbin
lechenie
libwww-perl
link checker
link/1.0
linkcheck
linkcheckv3.0
linkdex
linkdex.com/v2.0
linkdex.com/v2.1
linkdexbot
linkdexbot/2.0
linkdexbot/2.1
linkdexbot-mobile/2.1
linkpadbot
linkpadbot/1.06
linqiascrapebot
linqiascrapebot/1.0
lipperhey seo service
lipperhey
lipperhey-kaus-australis
lipperhey-kaus-australis/5.0
listicka
lssrocketcrawler
lssrocketcrawler/1.0 lightspeedsystems
lssrocketcrawler/1.0
ltx71
lwnutch/nutch-1.4
mail.ru
mail.ru_bot
mail.ru_bot/2.0
mail.ru_bot/fast/2.0
md5sum
md5sum\x22
meanpathbot
megaindex.ru
megaindex.ru/2.0
mezhpozvonochnoi
mike-crawler
mixbot
mixrankbot
mj12bot
monkeybot/0.1
my crawler
my nutch spider/nutch-1.9
mycrowl/nutch-1.9
mygreatua/2.0
myiptest
nameprotect robot
nerdybot
netcraft spider
netestate ne crawler
netlyzer fastprobe
netresearchserver
netresearchserver/4.0
nmap scripting engine
node.io
node.js
node/simplecrawler 0.5.2
node/simplecrawler
obot/2.3.1
omgilibot
omgilibot/0.4
online domain tools - online website link checker
online domain tools - online website link checker/1.2
openfind robot
openhosebot
openhosebot/2.1
openstat
openstat/0.1
optimizationcrawler
optimizationcrawler/0.2
page analyzer v4.0
page analyzer
pageanalyzer
pageanalyzer/1.1
pageanalyzer/1.5
pagesinventory
pagespeed/1.1 fetcher
pagespeed/1.1
pagespeedbot
perl lwp
phpcrawl
phpsitecheck 1.0
phpsitecheck
plukkie
pogs/2.0
powermarks
powerpivot
privacy_enfaq.jsp
prlog
prlog/1.0
publiclibraryarchive.org
publiclibraryarchive.org/1.0
pu_in crawler
putin
putin spider
qingdao
qlikview
quipu
quipu/1.0
quipu/2.0
r6_commentreader
r6_feedfetcher
riddler
rivalseek.com-bot
rogerbot
rogerbot/1.0
rootlink
ru_bot/2.0
scopia
scopia crawler
scopia crawler 1.0
scopia crawler 1.1
scopia crawler 1.2
scrapy
scrapy/0.16.5
scrapy/0.24.4
scrapy/0.24.5
scrapy/0.24.6
scrapy/1.0.1
screaming frog seo spider
screaming frog seo spider/2,55
screaming frog seo spider/2.55
screaming frog seo spider/3.1
screaming frog seo spider/3.3
screaming frog seo spider/4.1
screaming frog seo spider/5.0
screaming frog seo spider/5.1
screaming frog seo spider/5.1 beta 2
scrutiny/4
semrushbot
semrushbot-sa
seodiver/1.0
seokicks
seokicks-robot
seolyticscrawler
seolyticscrawler/3.0
seoscanners
seoscanners.net/1
seostats 2.1.0
seosys/nutch-2.3
setcronjob/1.0
seznambot
sheerboredom.experimental.robot
sheerboredom.experimental.robot/0.2
showyoubot
simplecrawler
sistrix crawler
sistrix
sitebot
sitebot/0.1
siteexplorer
siteexplorer/1.0
siteexplorer/1.0b
siteluxbot
siteluxbot/1.0
skimbot
skimbot/1.0
sky nutch crawler/nutch-1.9
smtbot
smtbot/1.0
snk screenshot bot
snk screenshot bot/0.20
sogou spider
sogou web spider
spambayes
spambayes/1.1a3+
spbot
spbot/4.4.2
spiderbot
spiderling
spiderbot/nutch-1.7
spray-can
spray-can/1.2.1
ssg/3.0
statastico
statastico/4.0
steeler
steeler/3.5
stratagems kumo
stratagems
studiofaca search
studiofaca
sukibot
sukibot_heritrix
sukibot_heritrix/3.1.1
superbot
superbot/2.6
surveybot
synapse
synthesio crawler release monalisa
tbot-nutch/nutch-1.10
traackr.com bot
trendictionbot
trendiction-bot
truebot
truebot/1.0
tulipchain/5.xx
twmbot/0.1
typhoeus
ucmore crawler app
umbot-ln
umbot-ln/1.0
updown_tester
urlchecker
v1.0/1.2
w3af.org
wasalive
wasalive-bot
vbseo
wbsearchbot
wbsearchbot/1.1
wearenotevil
webalta
webalta crawler
web corpus crawler
webcookies
webcookies/1.0
webcopier vx.xa
webnest 0.9
webql
webreaper
webscout
webscout/1.0
web-sniffer
web-sniffer/1.1.0
website extractor
webster pro v3.4
webtarantula.com crawler
wecrawlforthepeace
winhttrack
vegebot
vegi bot
welikelinks
vericitecrawler
vericitecrawler/nutch-1.9
whatweb
whatweb/0.4.8-dev
visited by http://tools.geek-tools.org
voila robot
voltron
woobot
woobot/1.1
woobot/2.0
vorboss web crawler
vorboss web crawler/nutch-2.3
worldbrewbot
worldbrewbot/2.1
worldwebheritage.org
worldwebheritage.org/1.0
wscheck.com
wscheck.com/1.0.0
www.deadlinkchecker.com
www.petitsage.fr site detector 0.4
www-mechanize
www-mechanize/1.74
xenu link sleuth
xenu's link sleuth
xovibot
xovibot/2.0
xspider
yandex robot
yandex
yetibot
yisouspider
yoozbot
yoozbot-2.2
zgrab/0.x
zzabmbot
zzabmbot/1.0
titan
netmechanic
cherrypicker
emailcollector
disco pump 3.1
netattache
netattache light 1.1
emailsiphon
webbandit
emailwolf
extractorpro
copyrightcheck
crescent
sitesnagger
prowebwalker
cheesebot
teleport
wget
miixpc
telesoft
website quester
webzip
moget/2.1
webzip/4.0
webstripper
webstripper/2.02
webstripper/2.59
websauger
webcopier
netants
mister pix
webauto
thenomad
www-collector-e
rma
libweb/clshttp
asterias
httplib
turingos
spanner
infonavirobot
harvest/1.5
bullseye/1.0
crescent internet toolpak http ole control v.1.0
cherrypickerse/1.0
cherrypickerelite/1.0
webbandit/3.50
nicerspro
microsoft url control - 5.01.4511
dittospyder
foobot
webmasterworldforumbot
spankbot
botalot
lwp-trivial/1.34
lwp-trivial
wget/1.6
bunnyslippers
microsoft url control - 6.00.8169
urly warning
wget/1.5.3
linkwalker
cosmos
moget
hloader
humanlinks
linkextractorpro
mata hari
lexibot
offline explorer
web image collector
the intraformant
true_robot/1.0
true_robot
blowfish/1.0
jennybot
miixpc/4.2
builtbottough
propowerbot/2.14
backdoorbot/1.0
tocrawl/urldispatcher
webenhancer
tighttwatbot
suzuran
vci webviewer vci webviewer win32
vci
szukacz/1.4
queryn metasearch
openfind data gathere
openfind
xenu's link sleuth 1.1c
xenu's
zeus
repomonkey bait & tackle/v1.01
repomonkey
zeus 32297 webster pro v2.9 win32
webster pro
erocrawler
linkscan/8.1a unix
kenjin spider
cegbfeieh
msproxy/2.0
Rule | Path |
---|---|
Disallow | / |
Disallow | *.js$ |
Disallow | *.jpg$ |
Disallow | *.png$ |
Disallow | *.css$ |
Disallow | *.gif$ |
Disallow | /cache/ |
Disallow | /files/ |
Disallow | /js/ |
Disallow | /blackhole/ |
Disallow | /WU-History/ |
Disallow | /wxwuhistory.php |
Other Records
Field | Value |
---|---|
sitemap | http://www.meteomincio.it/sitemap.xml |
Warnings
- 5 invalid lines.
Comments