guidadiprocida.com
robots.txt

Robots Exclusion Standard data for guidadiprocida.com

Resource Scan

Scan Details

Site Domain guidadiprocida.com
Base Domain guidadiprocida.com
Scan Status Ok
Last Scan2024-06-07T05:43:10+00:00
Next Scan 2024-06-14T05:43:10+00:00

Last Scan

Scanned2024-06-07T05:43:10+00:00
URL https://guidadiprocida.com/robots.txt
Domain IPs 104.21.95.151, 172.67.145.132, 2606:4700:3033::6815:5f97, 2606:4700:3036::ac43:9184
Response IP 104.21.95.151
Found Yes
Hash 4f2369d23e18a293721f6709eda99c31fad79dfe87d2055a77b3c18acf6d5f03
SimHash d31c528ac3b7

Groups

*

Rule Path
Disallow /writer/
Disallow /lib/
Disallow /core/modules/
Disallow /core/views/
Disallow /ai1wm-backups/
Disallow /core/ai1wm-backups/
Disallow /readme.html
Disallow /license.txt
Disallow /wp-config.php
Disallow /install.php
Disallow /*.log$
Disallow /*.tmp$
Disallow /*.bak$
Disallow /*.swp$
Disallow /?s=*
Disallow /search/
Disallow /*?p=*&preview=true
Disallow /*?page_id=*&preview=true

yandex
yandexturbo
yandexbot
baiduspider
baiduspider-video
baiduspider-image
sogou spider
youdaobot
linespider
barkrowler
spidersogou
exabot
swiftbot
ccbot
slurp
naverbot
yeti
moget
ichiro

Rule Path
Disallow /

blexbot
dotbot
proximic
yandexmetrika
screaming frog seo spider
xenu's
sistrix
seokicks
seokicks-robot
yisouspider
qwantify
uptimebot
lumar
cognitiveseo
oncrawl
ahrefsbot
ezooms
exabot
mj12bot
ccbot
meanpathbot
searchmetricsbot
slurp/2.0
sogou

Rule Path
Disallow /

ruby
python-requests
libwww-perl
dotbot
mail.ru_bot
alexibot
aqua_products
asterias
b2w
backdoorbot
becomebot
bloglovin
blowfish
bookmark search tool
bot
botalot
builtbottough
bullseye
bunnyslippers
calculon spider
cheesebot
cherrypicker
cherrypickerelite
cherrypickerse
coccoc
copernic
copyrightcheck
cosmos
crescent
daum
dittospyder
dumbot
emailcollector
emailsiphon
emailwolf
enterprise_search
erocrawler
es
exabot
extractorpro
ezooms
fairad client
fatbot
flaming attackbot
foobot
freefind
gaisbot
getright
grub
grub-client
harvest
hatena antenna
hloader
httplib
humanlinks
idg
infonavirobot
inoreader.com
iron33
jennybot
jetbot
jetbot
jikespider
kenjin spider
keyword density
larbin
lexibot
libweb/clshttp
linkextractorpro
linkscan
linkwalker
lnspiderguy
lwp-trivial
mata hari
megaindex.ru
microsoft url control
miixpc
mister pix
moget
megaindex.ru
spbot
msiecrawler
naver
netants
netmechanic
nicerspro
nutch
offline explorer
omniexplorer_bot
openbot
openfind
openfind data gathere
optimizer
oracle ultra search
parser
paperlibot
pcore-http
perman
php
propowerbot
prowebwalker
psbot
pu_in crawler
python-urllib
pyton-requests
qwantify
queryn metasearch
radiation retriever
repomonkey
repomonkey bait & tackle
rma
searchpreview
seznambot
sitebot
sitesnagger
smtbot
sogou
sootle
sosospider
spankbot
spanner
spinn3r
stanford
stanford comp sci
suzuran
swiftbot
szukacz
teleport
teleportpro
telesoft
the intraformant
thenomad
tocrawl
trident
true_robot
turingos
updownerbot
url control
url_spider_pro
urly warning
vci
vci webviewer vci webviewer win32
voilabot
web image collector
webauto
webbandit
webcopier
webenhancer
websauger
website quester
webster pro
webstripper
webvac
webzip
wget
willybot
www-collector-e
yodaobot
yak
zeus

Rule Path
Disallow /

Other Records

Field Value
sitemap https://guidadiprocida.com/sitemap_index.xml

Comments

  • SE
  • SEOTOOLS
  • OTHER

Warnings

  • 1 invalid line.