cammini.net
robots.txt

Robots Exclusion Standard data for cammini.net

Resource Scan

Scan Details

Site Domain cammini.net
Base Domain cammini.net
Scan Status Ok
Last Scan2024-11-13T11:18:44+00:00
Next Scan 2024-11-20T11:18:44+00:00

Last Scan

Scanned2024-11-13T11:18:44+00:00
URL https://cammini.net/robots.txt
Domain IPs 34.120.190.48, 35.190.31.54, 35.227.194.51, 35.244.153.44
Response IP 34.120.190.48
Found Yes
Hash f5f70d9e8738ff4d0d051ef2d0fa8a36f29fcde1a5a873ab2b87d9ca57bfd5d8
SimHash d35552aa47b6

Groups

*

Rule Path
Disallow /author/
Disallow /search/
Disallow /plugins/
Disallow /cgi-bin/
Disallow /wp-admin/
Disallow */?gcb=
Disallow *?gcb=*
Disallow */?fb_comment_id=
Disallow *?cb=*
Disallow */?utm_source=
Disallow */?utm_content=

Other Records

Field Value
crawl-delay 10

ahrefsbot
blexbot
dotbot
mj12bot
proximic
rogerbot
yandexmetrika
xenu's
sistrix
seokicks
seokicks-robot
yisouspider
qwantify
uptimebot

Rule Path
Disallow /

yandex
yandexturbo
yandexbot
baiduspider
linespider
barkrowler

Rule Path
Disallow /

ruby
python-requests
libwww-perl
dotbot
mail.ru_bot
alexibot
aqua_products
asterias
b2w
backdoorbot
becomebot
bloglovin
blowfish
bookmark search tool
bot
botalot
builtbottough
bullseye
bunnyslippers
calculon spider
cheesebot
cherrypicker
cherrypickerelite
cherrypickerse
coccoc
copernic
copyrightcheck
cosmos
crescent
daum
dittospyder
dumbot
emailcollector
emailsiphon
emailwolf
enterprise_search
erocrawler
es
exabot
extractorpro
ezooms
fairad client
fatbot
flaming attackbot
foobot
freefind
gaisbot
getright
grub
grub-client
harvest
hatena antenna
hloader
httplib
humanlinks
idg
infonavirobot
inoreader.com
iron33
jennybot
jetbot
jetbot
jikespider
kenjin spider
keyword density
larbin
lexibot
libweb/clshttp
linkextractorpro
linkscan
linkwalker
lnspiderguy
lwp-trivial
mata hari
megaindex.ru
microsoft url control
miixpc
mister pix
moget
megaindex.ru
spbot
msiecrawler
naver
netants
netmechanic
nicerspro
nutch
offline explorer
omniexplorer_bot
openbot
openfind
openfind data gathere
optimizer
oracle ultra search
parser
paperlibot
pcore-http
perman
php
propowerbot
prowebwalker
psbot
pu_in crawler
python-urllib
pyton-requests
qwantify
queryn metasearch
radiation retriever
repomonkey
repomonkey bait & tackle
rma
searchpreview
seznambot
sitebot
sitesnagger
smtbot
sogou
sootle
sosospider
spankbot
spanner
spinn3r
stanford
stanford comp sci
suzuran
swiftbot
szukacz
teleport
teleportpro
telesoft
the intraformant
thenomad
tocrawl
trident
true_robot
turingos
updownerbot
url control
url_spider_pro
urly warning
vci
vci webviewer vci webviewer win32
voilabot
web image collector
webauto
webbandit
webcopier
webenhancer
websauger
website quester
webster pro
webstripper
webvac
webzip
wget
willybot
wordpress
www-collector-e
yodaobot
yak
zeus

Rule Path
Disallow /

yandex
yandexturbo
yandexbot
baiduspider
linespider
barkrowler

Rule Path
Disallow /

Other Records

Field Value
sitemap https://cammini.net/sitemap_index.xml

Comments

  • SEOTOOLS
  • SE
  • User-agent: Slurp
  • 202010
  • SOCIAL
  • User-agent: Twitterbot
  • Disallow: /
  • OTHER
  • SE
  • User-agent: Slurp
  • 202010

Warnings

  • 1 invalid line.