unimarconi.it
robots.txt

Robots Exclusion Standard data for unimarconi.it

Resource Scan

Scan Details

Site Domain unimarconi.it
Base Domain unimarconi.it
Scan Status Ok
Last Scan2024-10-21T23:15:55+00:00
Next Scan 2024-11-20T23:15:55+00:00

Last Scan

Scanned2024-10-21T23:15:55+00:00
URL https://www.unimarconi.it/robots.txt
Domain IPs 156.54.165.80
Response IP 156.54.165.80
Found Yes
Hash c0423419e52b98ff5e8bace3339ebca7f0992c6c4a22080d06779eefeb5dd0ec
SimHash c7fdf213caa7

Groups

*

Rule Path
Allow

a1 sitemap generator
abachobot
abcdatos botlink
aboundexbot
aboutusbot
accoona-ai-agent
addsugarspiderbot
adidxbot
ahoy! the homepage finder
ahrefsbot
aitcsrobot/1.1
alexibot
amznkassocbot
aqua_products
arachnophilia
architextspider
aspider/0.09
asterias
auresys/1.0
awariorssbot
awariosmartbot
b2w/0.1
backdoorbot/1.0
backrub/.
baiduspider
baiduspider-image
baiduspider-news
baiduspider-video
becomebot
beslistbot
big brother
bizbot003
black hole
blackwidow
blexbot
blowfish
blowfish/1.0
bookmark search tool
botalot
botrighthere
builtbottough
bullseye/1.0
bunnyslippers
cactvs chemistry spider
catchbot
ccbot
cegbfeieh
checkbot/x.xx lwp/5.x
cheesebot
cherrypicker
cherrypickerelite/1.0
cherrypickerse/1.0
chinaclaw
coccoc
combine/0.0
conceptbot/0.3
copernic
copyrightcheck
cosmos
crescent
crescent internet toolpak http ole control v.1.0
custo
cyberpatrol sitecat webbot
cyberspyder/2.1
daumoa
deweb/1.01
disco
disco pump 3.0
disco pump 3.2
discofinder
dittospyder
dotbot
download demon
download demon/3.2.0.8
download demon/3.5.0.11
dumbot
ecatch
ecatch/3.0
eirgrabber
emailcollector
emailsiphon
emailwolf
enigmabot
envolk
erocrawler
exabot
explorersearch
express webpictures
extractorpro
eyenetie
fairad client
fdse robot
felixide/1.0
fido/0.9 harvest/1.4.pl2
fish-search-robot
flaming attackbot
flashget
flashget webwasher 3.2
foobot
freecrawl
frontpage
gaisbot
genieo
getright/4.2
getweb!
gigabot
girafabot
go!zilla
golem/1.1
grabnet
grafula
grapeshot
gromit/1.0
grub
grub-client
gsa-crawler
happyfunbot
harvest/1.5
hatena antenna
hazel's ferret web hopper
hloader
hmview
httplib
httrack
huaweisymantecspider
humanlinks
ia_archiver
image stripper
image sucker
inagist.com url crawler
incywincy/1.0b1
indy library
infonavirobot
informant
ingrid/0.1
interget
internet ninja 6.0
irlbot
iron33/1.0.2
israelisearch/1.0
iti spider
jennybot
jetcar
joc web spider
jubiirobot
jumpstation
katipo/1.0
kenjin spider
keyword density/0.9
labelgrab/1.1
larbin
leechftp
lexibot
libweb/clshttp
linguee bot
linkdexbot/2.1
linkedinbot
linkextractorpro
linklooker
linkscan/8.1a unix
linkwalker
lnspiderguy
looksmart
lwp-trivial
lwp-trivial/1.34
magpie-crawler
mass downloader/2.2
mata hari
mediafox/x.y
merzscope
metagopher
microsoft url control
microsoft url control - 5.01.4511
microsoft url control - 6.00.8169
midown tool
miixpc
miixpc/4.2
mister pix
mnogosearch
moget
moget/2.1
momspider/1.00 libwww-perl/0.40
motor/0.2
mozilla/4.0 (compatible; bullseye; windows 95)
msiecrawler
msr-isrccrawler
msrbot
naver
naverbot
navroad
nearsite
neosciocrawler
net vampire/3.0
netants
netcarta cyberpilot pro
netmechanic
netscoop/1.0 libwww/5.0a
netspider
netzip
nhsewalker/3.0
nicerspro
nomad-v2.x
northstar
npbot
nutch
obot
occam/1.0
octopus
offline explorer
ogspider
omniexplorer_bot
open text site crawler v1.0
openbot
openfind
openfind data gathere
oracle ultra search
pagegrabber
pagepeeker
papa foto
pavuk
pcbrowser
perman
pgp-ka/1.2
propowerbot/2.14
prowebwalker
psbot
python-urllib
queryn metasearch
r6_commentreader
r6_feedfetcher
radiation retriever 1.1
realdownload/4.0.0.42
reget
repomonkey
repomonkey bait & tackle/v1.01
resume robot
rma
roverbot
safetynet robot 0.1
scoutjet
searchpreview
senrigan/xxxxxx
seznambot
sitesnagger
slysearch
smartdownload
snooper/b97_01
sogou web spider
solbot/1.0 lwp/5.07
sootle
sosospider
spankbot
spanner
spanner/1.0 (linux 2.0.27 i586)
speedy spider
spyder3.microsys.com
superbot
superhttp/1.0
surfbot
surveybot
suzuran
szukacz/1.4
takeout
teleport
teleportpro
telesoft
the intraformant
thenomad
tighttwatbot
titan
tocrawl/urldispatcher
true_robot
true_robot/1.0
turingos
turnitinbot
twiceler
ucsd-crawler
unisterbot
unwindfetchor/1.0
url control
url_spider_pro
urlck/1.2.3
urly warning
valkyrie/1.0 libwww-perl/0.40
vbseo
vci
vci webviewer vci webviewer win32
voideye
voilabot
web image collector
web sucker
webauto
webbandit
webbandit/3.50
webcopier
webenhancer
webferret
webfetch
webleacher
webmasterworld extractor
webmasterworldforumbot
webreaper
websauger
website quester
webster pro
webstripper
webvac
webwalk
webwatch
webwhacker
webzip
webzip/4.0
wget
wget/1.5.3
wget/1.6
whowhere robot
widow
www-collector-e
xaldon webspider
yandex
zeus
zeus 32297 webster pro v2.9 win32
zeus link scout
zyborg

Rule Path
Disallow /

*

Rule Path
Disallow /?s=*
Disallow /?msite=*
Disallow /wp-admin/
Allow /wp/wp-admin/admin-ajax.php

Other Records

Field Value
sitemap https://www.unimarconi.it/sitemap_index.xml

Warnings

  • 4 invalid lines.