ipnews.ru
robots.txt

Robots Exclusion Standard data for ipnews.ru

Resource Scan

Scan Details

Site Domain ipnews.ru
Base Domain ipnews.ru
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-09-28T02:45:56+00:00
Next Scan 2024-12-27T02:45:56+00:00

Last Successful Scan

Scanned2022-12-04T23:11:24+00:00
URL https://ipnews.ru/robots.txt
Domain IPs 159.69.68.209
Response IP 159.69.68.209
Found Yes
Hash 803a4969367fb7e814a8c8a235b358f4c060684820d0b77b487ebd245303c3b2
SimHash c142e6f12ec2

Groups

*

Rule Path
Disallow /a/
Disallow /adv/
Disallow /cgi-bin
Disallow /wp-admin
Disallow /wp-includes
Disallow /wp-content/plugins
Disallow /wp-content/cache
Disallow /wp-trackback
Disallow /wp-feed
Disallow /wp-comments
Disallow */trackback
Disallow /xmlrpc.php
Disallow */feed
Disallow */comments
Disallow */?comments
Disallow */?attachment
Disallow /wp-cron.php
Disallow /author/admin/
Disallow /author/
Disallow /cgi-sys
Disallow /date*
Disallow /date/
Disallow /page/
Disallow /letter*
Disallow /letter/
Disallow /funkit/
Disallow /zakaz/
Disallow /people/
Disallow /mobile/
Disallow /m/
Disallow /game/
Disallow /wiki/
Disallow /game/
Disallow */gal/
Disallow */adm/
Disallow */galleryv/
Disallow */galleryp/
Disallow */yege/
Disallow */nahucheba/
Disallow /user/transport
Disallow /user/transportc
Disallow /user/message
Disallow /galerei
Disallow /foto/
Disallow /text2/
Disallow *ftp.pandia.ru

yandex

Rule Path
Disallow /a/
Disallow /adv/
Disallow /cgi-bin
Disallow /wp-admin
Disallow /wp-includes
Disallow /wp-content/plugins
Disallow /wp-content/cache
Disallow /wp-trackback
Disallow /wp-feed
Disallow /wp-comments
Disallow */trackback
Disallow /xmlrpc.php
Disallow */feed
Disallow */comments
Disallow */?comments
Disallow */?attachment
Disallow /wp-cron.php
Disallow /author/admin/
Disallow /author/
Disallow /cgi-sys
Disallow /date*
Disallow /date/
Disallow /en/
Disallow /page/
Disallow /letter*
Disallow /letter/
Disallow /funkit/
Disallow /zakaz/
Disallow /people/
Disallow /mobile/
Disallow /m/
Disallow /game/
Disallow /wiki/
Disallow /game/
Disallow */gal/
Disallow */adm/
Disallow */galleryv/
Disallow */galleryp/
Disallow */yege/
Disallow */nahucheba/
Disallow /user/transport
Disallow /user/transportc
Disallow /user/message
Disallow /galerei
Disallow /foto/
Disallow *ftp.pandia.ru

googlebot

Rule Path
Disallow /a/
Disallow /adv/
Disallow /cgi-bin
Disallow /cgi-bin
Disallow /wp-admin
Disallow /wp-includes
Disallow /wp-content/plugins
Disallow /wp-content/cache
Disallow /wp-trackback
Disallow /wp-feed
Disallow /wp-comments
Disallow */trackback
Disallow /xmlrpc.php
Disallow */feed
Disallow */comments
Disallow */?comments
Disallow */?attachment
Disallow /wp-cron.php
Disallow /author/admin/
Disallow /author/
Disallow /cgi-sys
Disallow /date*
Disallow /date/
Disallow /page/
Disallow /letter*
Disallow /letter/
Disallow /funkit/
Disallow /zakaz/
Disallow /people/
Disallow /mobile/
Disallow /m/
Disallow /sitemap/
Disallow */fullist*
Disallow /game/
Disallow /wiki/
Disallow */gal/
Disallow */adm/
Disallow */galleryv/
Disallow */galleryp/
Disallow */yege/
Disallow */nahucheba/
Disallow /user/transport
Disallow /user/transportc
Disallow /user/message
Disallow /galerei
Disallow /foto/
Disallow *ftp.pandia.ru

a1 sitemap generator
a1 website download
a6-indexer
aasp
abachobot
abonti
abotemailsearch
aboundex
aboutusbot
accmonitor compliance server
accoon
achulkov.net page walker
acme.spider
acoonbot
acquia-crawler
activetouristbot
acunetix
ad muncher
adamm bot
adauth
adbeat_bot
adbot
adcentriaim
admantx
admantx-euaspb
adminshop.com
adscanner
adstxtcrawler
adstxtlab.com crawler
advanced email extractor
aesop_com_spiderman
aespider
af knowledge now verity spider
afd-verbotsverfahren
aggregator:vocus
agusescan
ahc
ah-ha.com crawler
ahrefs
ahrefsbot
aibot
aidu
aihitbot
aipbot
aisiid
aitcsrobot/1.1
ajsitemap
akamai-sitesnapshot
alexawebsearchplatform
alexfdownload
alexibot
alkalinebot
all acronyms bot
alligator
allsubmitter
alohabrowser
alpha search agent
alphabot
amerla search bot
amfibibot
amp-cloud.de
ampmppc.com
amznkassocbot
anarchie
anemone
anglesharp
anonymous
anonymous coward
anotherbot
answerbot
answerbus
answerchase prove
ant.com
antbot
antibot
antisantyworm
antro.net
anyconnect
aonde-spider
apexoo
aport
appengine-google
appid: s~stremor-crawler-
aqua_products
arabot
arachmo
arachnophilia
argclrint
aria equalizer
arianna.libero.it
arikus_spider
arquivo-web-crawler
artavisbot
artera
art-online.com
asaha search engine turkey
ask
aspider
aspseek
asterias
astrofind
athenusbot
atlocalbot
atomic_email_hunter
attach
attrakt
attributor
augurfind
auresys
auskunftbot
autobaron crawler
autoemailspider
autowebdir
avsearch-
axfeedsbot
axios
axonize-bot
ayna
b2w
babya discoverer
backdoorbot
backlink-ceck
backlink-check
backlinkcrawler
backrub
backstreet
backstreet browser
backweb
badass
baiduspider
baiduspider-image
baiduspider-video
bandit
barkrowler
batchftp
battleztar bazinga
baypup
bbbike
bdcbot
bdfetch
becomebot
becomejpbot
beetlebot
bender
besserscheitern-crawl
betabot
bidswitchbot
big brother
big data
bigado.com
bigcliquebot
bigfoot
biglotron
bilbo
bilgibetabot
bilgibot
bintellibot
bitacle
bitlybot
bitvouseragent
bizbot
bizworks retriever
black hole
black.hole
blackbird
blackboard
blackmask.net search engine
blackwidow
bladder fusion
blaiz-bee
blexbot
blinkx
blitzbot
blog conversation project
blogmyway
blogpulselive
blogrefsbot
blogscope
blogslive
bloobybot
blow
blowfish
blt
bnf.fr_bot
boaconstrictor
boardreader
boi_crawl_00
boia.org
boia-scan-agent
boitho
bolt
bookmark buddy bookmark checker
bookmark search tool
bosug
bot apoena
botalot
botrighthere
botswana
bottybot
bpbot
bpimagewalker
braintime_search
brandonbot
brandprotect
brandwatch
brokenlinkcheck.com
browseremulator
browsermob
bruinbot
bsearchr&d
bspider
btbot
btsearch
bubing
buddy
buibui
buildcms crawler
builtbottough
builtwith
bullseye
bumblebee
bunnyslippers
buscadorclarin
buscaplus robi
butterfly
buyhawaiibot
buzzbot
buzzsumo
byindia
byspider
byteserver
bzbot
c r a w l 3 r
cacheblaster
caddbot
cafi
cakephp
calculon
camcrawler
camelstampede
camscanner
canon-webrecord
careerbot
cataguru
catchbot
catexplorador
cazoodle
cazoodlebot
ccbot
ccgcrawl
ccubee
cd-preload
cegbfeieh
ce-preload
ceracon
cerberian drtrs
cert figleafbot
cfetch
cfnetwork
chameleon
charlotte
check&get
checkbot
checklinks
checkmarknetwork
cheesebot
chemiede-nodebot
cherrypicker
chilkat
chinaclaw
chlooe
chromeframe
cipacrawler
cipinetbot
cis455crawler
citeseerxbot
cizilla
clariabot
claritybot
claritydailybot
climate ark
climateark spider
cliqzbot
cloud mapping
cloudflare-alwaysonline
cloudflare-amp
clshttp
clushbot
coast scan engine
coast webmaster pro
coccoc
coccocbot-image
coccocbot-web
cogentbot
cognitiveseo
collapsarweb
collector
colocrossing
com.plumanalytics
combine
communigatepro
comodo ssl checker
companybook-crawler
connectsearch
conpilot
contacts-crawler
contentsmartz
contextad bot
contxbot
contype
cookienet
coolbot
coolcheck
copernic
copier
copyrightcheck
copyscape
core-project
cors bot
cosmos
covario-ids
cowbot-
cowdog bot
crabbybot
craftbot
crawl.sogou.com
crawl_application
crawler.feedback
crawler.kpricorn.org
crawler@
crawler_for_infomine
crawler43.ejupiter.com
crawler4j
crawly
crazywebcrawler
creativecommons
crescent
cs-crawler
cse html validator
cshttp
cshttpclient
cuasarbot
culsearch
curb
curious
curious george
custo
cvaulev
cyberdog
cybernavi_webget
cyberpatrol sitecat webbot
cyberspyder
cydralspider
d1garabicengine
databasedrivermysqli
datacha0s
datafountains
datanyze
dataparksearch
dataprovider.com
datascape robot
dataspearspiderbot
dataspider
dattatec.com
daum
daumoa
davecrawler
dblbot
dcpbot
dcrawl
deadlinkchecker
declumbot
deepindex
deepnet crawler
deeptrawl
dejan
del.icio.us-thumbnails
deltascan
delvubot
demandbase-bot
demon
der groâ§e bildersauger
der groãŸe bildersauger
deusu
devil
df bot
dfs-fetch
diagem
diamond
dibot
didaxusbot
digext
digger
digincore
digi-rssbot
digitalarchivesbot
digitalpebble
digout4u
diibot
dillo
dir_snatch.exe
dirbuster
disco
discobot
discordbot
discoverybot
dispatch
distilled-reputation-monitor
dittospyder
djangotraineebot
dkimrepbot
dmoz downloader
dnyzbot
docomo
dof-verify
domainappender
domaincrawler
domainscan
domainsigmacrawler
domainstatsbot
domainwatcher bot
dotbot
dotspotsbot
dow jones searchbot
download
download demon
download devil
download wonder
doy
dragonbot
dragonfly
drip
drone
dtaagent
dts agent
dtsearchspider
dumbot
dwaar
dxseeker
eah
earth platform indexer
earth science educator robot
easydl
ebingbong
ec2linkfinder
ecairn-grabber
ecatch
eccp/1.0
echoosebot
ecxi
edisterbot
edugovsearch
egothor
eidetica.com
eirgrabber
elblindo the blind bot
elisabot
ellerdalebot
email collector
email extractor
email siphon
email wolf
emailcollector
emailleach
emailsiphon
emailwolf
embedly
emeraldshield
empas_robot
enabot
endeca
enigmabot
enolyst crawler
enswer neuro bot
enter user-agent
entitycubebot
eright
erocrawler
e-societyrobot
estylesearch
esyndicat bot
eurosoft-bot
evaal
evc-batch
eventware
everest-vulcan inc.
evil
evolution
exabot
exactsearch
exactseek
exooba
exploder
expmag
express webpictures
extlinksbot
extractor
extractorpro
extreme picture finder
eyenetie
ezooms
ez-robot
factbot
fairad client
falcon
faraday
fast data search document retriever
fast esp
fastbot crawler
fastbot.de crawler
fast-search-engine
fatbot
favcollector
faviconizer
favorg
favorites sweeper
f-bot test pilot
fdm
fdse robot
fedcontractorbot
fembot
femtosearchbot
fetch api request
fetch_ici
fetchbot
fgcrawler
fhscan
fiddlesticks
filangy
filehound
fimap
finbot
findanisp.com_isp_finder
findlinks
findweb
findxbot
firebat
firefox/7.0
firstgov.gov search
flaming attackbot
flamingo_searchengine
flashcapture
flashget
flickysearchbot
flipboardproxy
fluffy the spider
flunky
focused_crawler
focuseekbot
followsite
foobot
fooooo_web_video_crawl
fopper
forcepointcrawler
formulafinderbot
forschungsportal
fq
fr_crawler
francis
freeuploader
freewebmonitoring sitechecker
freshcrawler
freshdownload
freshlinks.exe
friendfeedbot
frodo.at
froggle
frontpage
froola bot
full_breadth_crawler
fu-nbi
funnelback
furlbot
fyberspider
fyrebot
g10-bot
gaisbot
galaxy
galaxybot
garlikcrawler
gazz
gbplugin
gdpr bot
generate_infomine_category_classifiers
genevabot
geniebot
genieo
geomaxenginebot
geometabot
geonabot
geovisu
germcrawler
gethtmlcontents
getintent
getintent crawler
getleft
getright
getsmart
geturl.rexx
getweb
giant
gigablast
gigablastopensource
gigabot
g-i-g-a-b-o-t
girafabot
gleamebot
gluten free crawler
gmscore
gnome-vfs
go!zilla
go-ahead-got-it
godzilla
goforit.com
goforitbot
go-http-client
gold crawler
goldfire server
golem
goodjelly
gordon-college-google-mini
goroam
goseebot
gotit
govbot
gozilla
gpu p2p crawler
grabber
grabnet
grafula
grapefx
grapeshot
grapeshotcrawler
grbot
greenyogi
gridbot
grobbot
gromit
grouphigh
grub
gslfbot
gt::www
gulliver
gulperbot
gurujibot
gvc business crawler
gvc crawler
gvc search bot
gvc web crawler
gvc weblink crawler
gvc world links
gvcbot.com
haansoft
hackney
haosouspider
happyfunbot
harvest
hatena antenna
havij
hawler
hcat
hclsreport-crawler
hd nutch agent
header_test_client
headmasterseo
healia
helix
here will be link to crawler site
heritrix
hiscan
hisoftware accmonitor server
hisoftware accverify
hitcrawler
hivabot
hloader
hmsebot
hmview
hoge
holmes
homepagesearch
honeybee
hooblybot-image
hoowwwer
hostcrawler
hsft - link scanner
hsft - lvu scanner
hslide
ht://check
htdig
html link validator
htmlparser
http::lite
httplib
httrack
huaweisymantecspider
hubspot webcrawler
hul-wax
humanlinks
hybridbot
hyperestraier
hyperix
hyscore
ia_archiver
iabtechlab ads.txt crawler
ias crawler
iblog
ibuena
icab
icds-ingestion
ichiro
icopyright conductor
idbot
idg/uk
idmarch automatic
id-search
ieautodiscovery
iecheck
iframely
ihwebchecker
iiitbot
iim_405
ilsebot
iltrovatore
image fetch
image stripper
image sucker
imagebot
image-fetcher
imagefortress
imageshereimagesthereimageseverywhere
imagevisu
imds_monitor
imo-google-robot-intelink
implisensebot
inagist.com url crawler
indeedbot
indexer
industry cortex webcrawler
indy library
indylabs_marius
inelabot
inet32 ctrl
inetbot
ineturl
info seeker
infolink
infomine
infonavirobot
informant
infoseek sidewinder
infotekies
infousabot
ingrid
inktomi
inoreader
insightscollector
insightsworksbot
inspirebot
instabid
insumascout
intelix
intelliseek
interget
internet ninja
internet radio crawler
internetlinkagent
internetseer
internetvista monitor
interseek
ioi
ipadd bot
ips-agent
ipselonbot
ip-web-crawler.com
iria
irlbot
iron33
isara
isearch
isilox
iskanie
istellabot
its-learning crawler
iu_csci_b659_class_crawler
ivia
jadynave
james bot
jamesbot
jbot
jbrofuzz
jemmathetourist
jennybot
jersey
jetbot
jetbrains omea pro
jetcar
jikespider
jim
jobboersebot
jobo
jobspider_ba
joc
joc web spider
joedog
jooblebot
joomla
jorgee
joyscapebot
jpg-newsbot
jspyda
junut bot
justview
jyxobot
k.s.bot
kakclebot
kalooga
katatudo-spider
kbeta1
keepni web site monitor
kenjin spider
kenjin.spider
keybot translation-search-machine
keywenbot
keyword density
keyword.density
kinjabot
kitenga-crawler-bot
kiwistatus
kmbot-
kmccrew bot search
k-meleon
knight
knowitall
knowledge engine
knowledge.com
kocmohabt
koepabot
komodiabot
koninklijke
korniki
kozmosbot
krowler
ksbot
kuloko-bot
kulturarw3
kummhttp
kurzor
kyluka crawl
l.webis
labhoo
labourunions411
lachesis
lament
lamerexterminator
lanshanbot
lapozzbot
larbin
laserlikebot
lbot
lbwappalyzer
leaptag
leechftp
leechget
letscrawl.com
lexibot
lexxebot
lftp
libcrawl
libiviacore
libw
libweb
libwhisker
lightspeedsystems
lightspeedsystemscrawler
likse
linguee bot
link checker
link validator
link_checker
linkalarm
linkbot
linkcheck by siteimprove.com
linkcheck scanner
linkchecker
linkdex.com
linkdexbot
linkextractorpro
linklint
linklooker
linkman
linkpadbot
links sql
linkscan
linksmanager
linksmanager.com_bot
linksweeper
linkwalker
linqiametadatadownloaderbot
linqiarssbot
linqiascrapebot
lipperhey
litefinder
litemage_walker
litlrbot
little grabber at skanktale.com
livelapbot
lm harvester
lmqueuebot
lmspider
lnspiderguy
loadtimebot
localcombot
locust
lolongbot
lookbot
lsearch
lssbot
lt scotland checklink
ltx71
lwp
lwp::simple
lwp-request
lwp-trivial
lycos_spider
lydia entity spider
lynnbot
lytranslate
magic browser
magnet
mag-net
magpie-crawler
magus bot
mainseek_bot
majestic12
mammoth
maneturlcheck
map robot
mappy
markmonitor
markwatch
masagool
masidani_bot_
mass downloader
masscan
mata hari
mata.hari
matentzn at cs dot man dot ac dot uk
matlab
mauibot
maxamine.com
maxomobot
mbcrawler
mcbot
meanpathbot
mechanize
mediawords
mediawords bot
medrabbit
megaindex.ru
megite
memacbot
memo
mendeleybot
mercator-
mercuryboard_user_agent_sql_injection.nasl
meta_bot
metacarta
metaeuro web search
metager2
metagloss
metajobbot
metal crawler
metaquerier
metaspider
metaspinner
metauri
mfc_tear_sample
mfcrawler
mfhttpscan
microsoft data access
microsoft url control
midown tool
miixpc
mindupbot
minibot
minirank
mini-robot
mirror
missigua locator
mister pix
mister.pix
miva
mixnode
mixrankbot
mj12bot
mnogosearch
mod_accessibility
moduna.com
moget
mojeek
mojeekbot
mojolicious
monkeycrawl
morfeus fucking scanner
moses
mowserbot
mqbot
mr.4x3
ms web services client protocol
mse360
msfrontpage
msie 6.0
msiecrawler
msindianwebcrawl
msmobot
msnptc
msrabot
msrbot
mt-soft
muhstik-scan
multitext
musobot
my_little_searchengine_project
myapp
mycompanybot
mycrawler
myengines-us-bot
myfamilybot
my-heritrix-crawler
myra
nabot
najdi.si
nambu
name intelligence
nameprotect
nasa search
natchcvs
nativehost
natweb-bad-link-mailer
naver
navroad
nearsite
nec-meshexplorer
needle
neofonie search:robot
neosciocrawler
nerdbynature.bot
nerdybot
nerima-crawl-
nessus
nestreader
net vampire
net::trackback
netants
netcarta cyberpilot pro
netcraft
netestate ne crawler
netexperts
netid.com bot
netlyzer
netmechanic
netnewswire
netobjects fusion
netpeakcheckerbot
netpeakspiderbot
netprospector
netresearchserver
netseer
netshift
netsongbot
netsparker
netspider
netsrcherp
nettrack
netvibes
netzip
newmedhunt
news bot
news_search_app
newsgatherer
newsgatoronline
newsgroupreporter
newstrovebot
nextgensearchbot
nextthing.org
nibbler
nicebot
nicerspro
niki-bot
nikto
nimblecrawler
nimbostratus-bot
nimbus-1
ninetowns
ninja
ninjabot
njuicebot
nlese
nmap
nmap scripting engine
node/simplecrawler
nogate
norbert the spider
noteworthybot
novomind ishop linkbot
npbot
nrcan intranet crawler
nsdl_search_bot
nsrbot
nu_tch
nuggetize.com bot
nusearch spider
nutch
nwspider
nymesis
nys-crawler
objectssearch
obot
obvius external linkcheck
ocelli
octopus
odp entries t_st
oegp
offline explorer
offline navigator
offline.explorer
ogspider
okhttp
omgili
omiexplorer_bot
omniexplorer
omnifind
omniweb
onalyticabot
onbbot
onetszukaj
online link validator
online-webceo-bot
onpagebot
onpageleadsbot
oozbot
openbot
openfind
openintelligencedata
openisearch
openlink virtuoso rdf crawler
openlinkprofiler
opensearchserver_bot
openvas
opidig
optidiscover
optimizer
oracle secure enterprise search
oracle ultra search
orangebot
orangespider
orisbot
ornl_crawler
ornl_mercury
osis-project.jp
oso
outclicksbot
outfoxbot
outfoxmelonbot
owler-bot
owsbot
ozelot
p3p client
page analyzer
page grabber
page scorer
page_verifier
pageanalyzer
pagebiteshyperbot
pagebull
pagedown
pagefetcher
pagegrabber
pagepeeker
pagerank monitor
pagescorer
pamsnbot.htm
panopy bot
panscient
panscient.com
pansophica
papa foto
paperlibot
parasite
parsijoo
pathtraq
pattern
patwebbot
pavuk
paxleframework
pbbot
pcbrowser
pcore-http
pd-crawler
pdfdrivecrawler
pecl::http
penthesila
peoplepal
perform_crawl
perman
personal ultimate crawler
phantomjs
photon
php version tracker
phpcrawl
phpdig
picosearch
picscout
picsearch
picturefinder
pieno robot
pierre smith
pimonster
pi-monster
pingdompagespeed
pinner-ios
pipbot
pipeliner
piplbot
pita
pixfinder
pixray
piyushbot
pk bot
planetwork bot search
pleasecrawl
plucker
plukkie
plumanalytics
plumtree
pocketparser
pockey
pocohttp
poe-component-client-http
pogodak.ba
pogodak.co.yu
poirot
polybot
pompos
poodle predictor
popscreenbot
postpost
privacyfinder
probethenet
projectwf-java-test-crawler
propowerbot
prowebwalker
proxem websearch
proximic
proxy crawler
psbot
pss-bot
psscanapp
psycheclone

Rule Path
Disallow /

Warnings

  • 16 invalid lines.
  • `clean-param` is not a known field.
  • `host` is not a known field.