lascatoladeisegreti.it
robots.txt

Robots Exclusion Standard data for lascatoladeisegreti.it

Resource Scan

Scan Details

Site Domain lascatoladeisegreti.it
Base Domain lascatoladeisegreti.it
Scan Status Ok
Last Scan2024-10-05T17:32:30+00:00
Next Scan 2024-10-12T17:32:30+00:00

Last Scan

Scanned2024-10-05T17:32:30+00:00
URL https://lascatoladeisegreti.it/robots.txt
Domain IPs 104.21.15.136, 172.67.162.169
Response IP 172.67.162.169
Found Yes
Hash a65dbb59d51b8b1cb966f7fcecfb095d6002160e83224019e9d4cd309e9e2f0d
SimHash 93693353886a

Groups

mj12bot
zoombot
ahrefsbot
aihitbot
alphaseobot
alphaseobot-sa
backlinkcrawler
baiduspider
baiduspider-image
baiduspider-video
blackwidow
blexbot
bubing
careerbot
ccbot
chinaclaw
cliqzbot
cloudservermarketspider
coccoc
custo
disco
domain re-animator bot
dotbot
dotbot
download\ demon
ecatch
eccp/1.0 (search@eniro.com)
eirgrabber
emailsiphon
emailwolf
exabot
express\ webpictures
extractorpro
eyenetie
ezooms robot
flashget
fr-crawler
getright
getweb!
gigabot
go!zilla
go-ahead-got-it
grabnet
grafula
grapeshot
grapeshotcrawler/2.0
gsa-crawler (enterprise; t4-knhh62cdkc2w3; gsa_manage@nikon-sys.co.jp)
gsa-crawler
haosouspider
hmview
httrack
icc-crawler/2.0
iccrawler
image\ stripper
image\ sucker
implisensebot
indy\ library
interget
internet\ ninja
jetcar
jobboersebot
jobs.de-robot
joc\ web\ spider
kraken
larbin
leechftp
linkdexbot/2.1
linkstats
lipperhey-kaus-australis
magpie-crawler
mass\ downloader
meanpathbot
megaindex.com
megaindex.ru
megaindex.ru
megaindex.ru/2.0
metajobbot
midown\ tool
mindupbot
mister\ pix
navroad
nearsite
nerdybot
net\ vampire
netants
netestate ne crawler
netestate ne crawler (+http://www.website-datenbank.de/)
netspider
netzip
obot
octopus
offline\ explorer
offline\ navigator
openhosebot
pagegrabber
papa\ foto
pavuk
pcbrowser
perl lwp
pi-monster
pimonster
pimonster
plista
plukkie
proximic
qwantify
r6_commentreader
realdownload
reget
rogerbot
rogerbot
safednsbot
screaming frog seo spider
searchmetricsbot
semrushbot
seodiver
seokicks-robot
seoscanners.net
seznambot
superbot
superhttp
surfbot
surveybot
takeout
teleport\ pro
thumbsniper
sg-orbiter
sistrix
sistrix
sistrix crawler
sitesnagger
smartdownload
sogou spider
sogou spider
spbot
spiderbot
trendictionbot
webzip
wget
widow
wiseguys robot
wotbox
wwwoffle
xaldon\ webspider
yadirectfetcher
yandex
yandexmobilebot
youdaobot
zeus
turnitin robot
turnitinbot
um-ic
unisterbot
uptimerobot/2.0
voideye
wbsearchbot
web\ image\ collector
web\ sucker
webauto
webcopier
webfetch
webgo\ is
webleacher
webreaper
websauger
website\ extractor
website\ quester
webstripper
webwhacker

Rule Path
Disallow /

*

Rule Path
Disallow /privacy-policy
Disallow /cookie-policy

Other Records

Field Value
sitemap https://www.lascatoladeisegreti.it/sitemap_index.xml

Comments

  • rimettere User-agent: ia_archiver

Warnings

  • 2 invalid lines.