cardinalsiri.it
robots.txt

Robots Exclusion Standard data for cardinalsiri.it

Resource Scan

Scan Details

Site Domain cardinalsiri.it
Base Domain cardinalsiri.it
Scan Status Ok
Last Scan2024-09-30T18:51:39+00:00
Next Scan 2024-10-30T18:51:39+00:00

Last Scan

Scanned2024-09-30T18:51:39+00:00
URL https://cardinalsiri.it/robots.txt
Domain IPs 104.21.68.56, 172.67.187.190
Response IP 172.67.187.190
Found Yes
Hash c0deb865ea1e14cda0177b8f41976cc54f1784e0bdf14026a13038f4ea3126ff
SimHash 93693353884b

Groups

*

Rule Path
Allow /
Disallow /cookie-policy
Disallow /privacy-policy

mj12bot
zoombot
ahrefsbot
aihitbot
alphaseobot
alphaseobot-sa
backlinkcrawler
baiduspider
baiduspider-image
baiduspider-video
blackwidow
blexbot
bubing
careerbot
ccbot
chinaclaw
cliqzbot
cloudservermarketspider
coccoc
custo
disco
domain re-animator bot
dotbot
dotbot
download\ demon
ecatch
eccp/1.0 (search@eniro.com)
eirgrabber
emailsiphon
emailwolf
exabot
express\ webpictures
extractorpro
eyenetie
ezooms robot
flashget
fr-crawler
getright
getweb!
gigabot
go!zilla
go-ahead-got-it
grabnet
grafula
grapeshot
grapeshotcrawler/2.0
gsa-crawler (enterprise; t4-knhh62cdkc2w3; gsa_manage@nikon-sys.co.jp)
gsa-crawler
haosouspider
hmview
httrack
ia_archiver
icc-crawler/2.0
iccrawler
image\ stripper
image\ sucker
implisensebot
indy\ library
interget
internet\ ninja
jetcar
jobboersebot
jobs.de-robot
joc\ web\ spider
kraken
larbin
leechftp
linkdexbot/2.1
linkstats
lipperhey-kaus-australis
magpie-crawler
mass\ downloader
meanpathbot
megaindex.com
megaindex.ru
megaindex.ru
megaindex.ru/2.0
metajobbot
midown\ tool
mindupbot
mister\ pix
navroad
nearsite
nerdybot
net\ vampire
netants
netestate ne crawler
netestate ne crawler (+http://www.website-datenbank.de/)
netspider
netzip
obot
octopus
offline\ explorer
offline\ navigator
openhosebot
pagegrabber
papa\ foto
pavuk
pcbrowser
perl lwp
pi-monster
pimonster
pimonster
plista
plukkie
proximic
qwantify
r6_commentreader
realdownload
reget
rogerbot
rogerbot
safednsbot
screaming frog seo spider
searchmetricsbot
semrushbot
seodiver
seokicks-robot
seoscanners.net
seznambot
sg-orbiter
sistrix
sistrix
sistrix crawler
sitesnagger
smartdownload
sogou spider
sogou spider
spbot
spiderbot
superbot
superhttp
surfbot
surveybot
takeout
teleport\ pro
thumbsniper
trendictionbot
turnitin robot
turnitinbot
um-ic
unisterbot
uptimerobot/2.0
voideye
wbsearchbot
web\ image\ collector
web\ sucker
webauto
webcopier
webfetch
webgo\ is
webleacher
webreaper
websauger
website\ extractor
website\ quester
webstripper
webwhacker
webzip
wget
widow
wiseguys robot
wotbox
wwwoffle
xaldon\ webspider
yadirectfetcher
yandex
yandexmobilebot
youdaobot
zeus

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

Other Records

Field Value
sitemap http://www.cardinalsiri.it/sitemap.xml

Warnings

  • 2 invalid lines.