glavbusina.ru
robots.txt

Robots Exclusion Standard data for glavbusina.ru

Resource Scan

Scan Details

Site Domain glavbusina.ru
Base Domain glavbusina.ru
Scan Status Ok
Last Scan2024-06-07T05:35:49+00:00
Next Scan 2024-07-07T05:35:49+00:00

Last Scan

Scanned2024-06-07T05:35:49+00:00
URL https://glavbusina.ru/robots.txt
Domain IPs 92.38.128.246, 92.38.128.27
Response IP 92.38.128.27
Found Yes
Hash 9cb0bcf9e6d45a9c4c6245b8c6e1c85f8bb4f2a07b35f6497d29a61573896956
SimHash 82ec9dc7ee11

Groups

*

Rule Path
Disallow /auth/
Disallow /bitrix/
Disallow /local/
Disallow /login/
Disallow /personal/
Disallow /search/
Disallow /a/
Disallow /stat/
Disallow /news/
Disallow /gb/*
Disallow /index/
Disallow /load/
Disallow /online/
Disallow /photo
Disallow /shop/
Disallow /publ/
Disallow /reviews
Disallow /forum
Disallow /panel/
Disallow /admin/
Disallow /secure/
Disallow /informer/
Disallow /mchat
Disallow /abnl/
Disallow /google
Disallow /twitter
Disallow /facebook
Disallow /yandex/
Disallow /vkontakte
Disallow /temp
Disallow /users
Disallow /social/*
Disallow /include/*
Disallow /desctop_app/*
Disallow /tovary-so-skidkami2.php
Disallow /forma_obratnoy_svayzi
Disallow /lidery_prodazh
Disallow *?*
Disallow *PAGEN*
Disallow /catalogfoto
Disallow /store
Disallow /?ssid=
Disallow *catalog_filter_pf*
Disallow *SORT%3D*
Disallow *set_filter%3D*
Disallow *f_art%3D*
Disallow *f_other1%3D*
Disallow *f_other2%3D*
Disallow *f_other3%3D*
Disallow *f_other4%3D*
Disallow *f_other5%3D*
Disallow *f_other6%3D*
Disallow *f_other7%3D*
Disallow *f_other8%3D*
Disallow *f_other9%3D*
Disallow *.php$
Disallow *.pdf$
Disallow *.xls$
Disallow */index.php$
Disallow */404.php$

Other Records

Field Value
crawl-delay 0.2

yandex

Rule Path
Disallow /auth/
Disallow /bitrix/
Disallow /local/
Disallow /login/
Disallow /personal/
Disallow /search/
Disallow /a/
Disallow /stat/
Disallow /news/
Disallow /gb/*
Disallow /index/
Disallow /load/
Disallow /online/
Disallow /photo
Disallow /shop/
Disallow /publ/
Disallow /reviews
Disallow /forum
Disallow /panel/
Disallow /admin/
Disallow /secure/
Disallow /informer/
Disallow /mchat
Disallow /abnl/
Disallow /google
Disallow /twitter
Disallow /facebook
Disallow /yandex/
Disallow /vkontakte
Disallow /temp
Disallow /users
Disallow /social/*
Disallow /include/*
Disallow /desctop_app/*
Disallow /tovary-so-skidkami2.php
Disallow /forma_obratnoy_svayzi
Disallow /lidery_prodazh
Disallow *?*
Disallow *PAGEN*
Disallow /catalogfoto
Disallow /store
Disallow /?ssid=
Disallow *catalog_filter_pf*
Disallow *SORT%3D*
Disallow *set_filter%3D*
Disallow *f_art%3D*
Disallow *f_other1%3D*
Disallow *f_other2%3D*
Disallow *f_other3%3D*
Disallow *f_other4%3D*
Disallow *f_other5%3D*
Disallow *f_other6%3D*
Disallow *f_other7%3D*
Disallow *f_other8%3D*
Disallow *f_other9%3D*
Disallow *.php$
Disallow *.pdf$
Disallow *.xls$
Disallow */index.php$
Disallow */404.php$

ahrefsbot

Rule Path
Disallow /

betabot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

boardreader

Rule Path
Disallow /

bpimagewalker

Rule Path
Disallow /

checkmarknetwork

Rule Path
Disallow /

clockwork data vault

Rule Path
Disallow /

coccoc

Rule Path
Disallow /

crazywebcrawler

Rule Path
Disallow /

disco pump

Rule Path
Disallow /

domain re-animator

Rule Path
Disallow /

domainstatsbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

easouspider

Rule Path
Disallow /

ezooms

Rule Path
Disallow /

findxbot

Rule Path
Disallow /

getintent

Rule Path
Disallow /

gigabot

Rule Path
Disallow /

grammarly

Rule Path
Disallow /

guardcrwlr

Rule Path
Disallow /

hubspot links crawler

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

istellabot

Rule Path
Disallow /

kocmohabt

Rule Path
Disallow /

linkdexbot

Rule Path
Disallow /

linkscrawler

Rule Path
Disallow /

linkspammer

Rule Path
Disallow /

linkwalker

Rule Path
Disallow /

ltx71

Rule Path
Disallow /

mail.ru_bot

Rule Path
Disallow /

mj12bot

Rule Path
Disallow /

netinfobot

Rule Path
Disallow /

nettrack

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

okhttp

Rule Path
Disallow /

openlinkprofiler

Rule Path
Disallow /

paperlibot

Rule Path
Disallow /

pingdom

Rule Path
Disallow /

proximic

Rule Path
Disallow /

psbot

Rule Path
Disallow /

pu_in

Rule Path
Disallow /

pulsepoint

Rule Path
Disallow /

qwantify

Rule Path
Disallow /

rankvalbot

Rule Path
Disallow /

salesintelligent

Rule Path
Disallow /

scrapy

Rule Path
Disallow /

searchie

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

seokicks-robot

Rule Path
Disallow /

socialrankiobot

Rule Path
Disallow /

sogou

Rule Path
Disallow /

sogou web spider

Rule Path
Disallow /

spbot

Rule Path
Disallow /

surveybot

Rule Path
Disallow /

teleport pro

Rule Path
Disallow /

twiceler

Rule Path
Disallow /

uptimebot

Rule Path
Disallow /

vegi bot

Rule Path
Disallow /

vericitecrawler

Rule Path
Disallow /

web-by-mail

Rule Path
Disallow /

webmeup-crawler

Rule Path
Disallow /

weborama-fetcher

Rule Path
Disallow /

websnake

Rule Path
Disallow /

webzip

Rule Path
Disallow /

wget

Rule Path
Disallow /

yisouspider

Rule Path
Disallow /

youdaobot

Rule Path
Disallow /

Other Records

Field Value
sitemap http://glavbusina.ru/sitemap.xml

Comments

  • Disallow: /ct*
  • Allow: *?PAGEN_1=*
  • Disallow: /ct*
  • Allow: *?PAGEN_1=*
  • Disallow other crawlers made us CPU load

Warnings

  • 4 invalid lines.
  • `clean-param` is not a known field.