basilicasantambrogio.it
robots.txt

Robots Exclusion Standard data for basilicasantambrogio.it

Resource Scan

Scan Details

Site Domain basilicasantambrogio.it
Base Domain basilicasantambrogio.it
Scan Status Ok
Last Scan2024-05-27T02:28:32+00:00
Next Scan 2024-06-26T02:28:32+00:00

Last Scan

Scanned2024-05-27T02:28:32+00:00
URL https://basilicasantambrogio.it/robots.txt
Redirect https://www.basilicasantambrogio.it/robots.txt
Redirect Domain www.basilicasantambrogio.it
Redirect Base basilicasantambrogio.it
Domain IPs 89.46.105.64
Redirect IPs 89.46.105.64
Response IP 89.46.105.64
Found Yes
Hash 2edaf3f278e36f1cdbb2bd372216faa8f250d50eb82456bf3896a1353e515b6e
SimHash 93555382c3b2

Groups

*
yandex
yandexturbo
yandexbot
baiduspider
yisouspider
petalbot
bytespider
sogou web spider
sogou inst spider
linespider
barkrowler

Rule Path
Disallow /

baiduspider
baiduspider-video
baiduspider-image

Rule Path
Disallow /

sogou spider

Rule Path
Disallow /

youdaobot

Rule Path
Disallow /

ahrefsbot
blexbot
dotbot
screaming frog seo spider
mj12bot
semrushbot
semrushbot-ba
semrushbot-ct
semrushbot-bm
semrushbot-sa
semrushbot-swa
semrushbot-si
siteauditbot
splitsignalbot
proximic
rogerbot
yandexmetrika
xenu's
sistrix
seokicks
seokicks-robot
yisouspider
qwantify
uptimebot

Rule Path
Disallow /

zoombot

Rule Path
Disallow /

ruby
python-requests
libwww-perl
dotbot
mail.ru_bot
alexibot
aqua_products
asterias
b2w
backdoorbot
becomebot
bloglovin
blowfish
bookmark search tool
bot
botalot
builtbottough
bullseye
bunnyslippers
calculon spider
cheesebot
cherrypicker
cherrypickerelite
cherrypickerse
coccoc
copernic
copyrightcheck
cosmos
crescent
daum
dittospyder
dumbot
emailcollector
emailsiphon
emailwolf
enterprise_search
erocrawler
es
exabot
extractorpro
ezooms
fairad client
fatbot
flaming attackbot
foobot
freefind
gaisbot
getright
grub
grub-client
harvest
hatena antenna
hloader
httplib
humanlinks
idg
infonavirobot
inoreader.com
iron33
jennybot
jetbot
jetbot
jikespider
kenjin spider
keyword density
larbin
lexibot
libweb/clshttp
linkextractorpro
linkscan
linkwalker
lnspiderguy
lwp-trivial
mata hari
megaindex.ru
microsoft url control
miixpc
mister pix
moget
megaindex.ru
spbot
msiecrawler
naver
netants
netmechanic
nicerspro
nutch
offline explorer
omniexplorer_bot
openbot
openfind
openfind data gathere
optimizer
oracle ultra search
parser
paperlibot
pcore-http
perman
php
propowerbot
prowebwalker
psbot
pu_in crawler
python-urllib
pyton-requests
qwantify
queryn metasearch
radiation retriever
repomonkey
repomonkey bait & tackle
rma
searchpreview
seznambot
sitebot
sitesnagger
smtbot
sogou
sootle
sosospider
spankbot
spanner
spinn3r
stanford
stanford comp sci
suzuran
swiftbot
szukacz
teleport
teleportpro
telesoft
the intraformant
thenomad
tocrawl
trident
true_robot
turingos
updownerbot
url control
url_spider_pro
urly warning
vci
vci webviewer vci webviewer win32
voilabot
web image collector
webauto
webbandit
webcopier
webenhancer
websauger
website quester
webster pro
webstripper
webvac
webzip
wget
willybot
woorankreview
wordpress
www-collector-e
yodaobot
yak
zeus

Rule Path
Disallow /

moget
ichiro

Rule Path
Disallow /

naverbot
yeti

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.basilicasantambrogio.it/sitemap.xml

Comments

  • basilicasantambrogio.it robots.txt
  • SE
  • User-agent: Slurp
  • SEOTOOLS
  • https://it.semrush.com/bot/
  • SOCIAL
  • User-agent: Twitterbot
  • Disallow: /
  • OTHER

Warnings

  • 2 invalid lines.
  • `disallosw` is not a known field.