1areal.ru
robots.txt

Robots Exclusion Standard data for 1areal.ru

Resource Scan

Scan Details

Site Domain 1areal.ru
Base Domain 1areal.ru
Scan Status Ok
Last Scan2024-10-02T04:41:32+00:00
Next Scan 2024-10-09T04:41:32+00:00

Last Scan

Scanned2024-10-02T04:41:32+00:00
URL https://1areal.ru/robots.txt
Redirect https://www.1areal.ru/robots.txt
Redirect Domain www.1areal.ru
Redirect Base 1areal.ru
Domain IPs 5.23.51.236
Redirect IPs 5.23.51.236
Response IP 5.23.51.236
Found Yes
Hash cb004a30e51b979af771701ea3da361f4162f926e458584a0797fccbb8ba22bb
SimHash ffb0515bf711

Groups

yandex

Rule Path
Allow /
Allow /pages
Disallow /profile
Disallow /tmp
Disallow /*?$
Disallow /*addob?
Disallow /*editob
Disallow /*delete
Disallow /*page
Disallow /ajax
Disallow /*?_openstat
Disallow /*?utm_*=
Disallow /*%26utm_*%3D
Disallow /*sort%3D
Disallow /*order%3D
Disallow /*map%3D
Disallow /*fy%3D
Disallow /*pols%3D

Other Records

Field Value
crawl-delay 5

*

Rule Path
Allow /
Allow /pages
Disallow /profile
Disallow /tmp
Disallow /*?$
Disallow /*addob?
Disallow /*editob
Disallow /*delete
Disallow /*page
Disallow /ajax
Disallow /*?_openstat
Disallow /*?utm_*=
Disallow /*%26utm_*%3D
Disallow /*sort%3D
Disallow /*order%3D
Disallow /*map%3D
Disallow /*fy%3D
Disallow /*pols%3D

ubicrawler

Rule Path
Disallow /

doc

Rule Path
Disallow /

zao

Rule Path
Disallow /

sitecheck.internetseer.com

Rule Path
Disallow /

zealbot

Rule Path
Disallow /

sitesnagger

Rule Path
Disallow /

webstripper

Rule Path
Disallow /

webcopier

Rule Path
Disallow /

fetch

Rule Path
Disallow /

offline explorer

Rule Path
Disallow /

teleport

Rule Path
Disallow /

teleportpro

Rule Path
Disallow /

webzip

Rule Path
Disallow /

linko

Rule Path
Disallow /

httrack

Rule Path
Disallow /

xenu

Rule Path
Disallow /

larbin

Rule Path
Disallow /

libwww

Rule Path
Disallow /

zyborg

Rule Path
Disallow /

download ninja

Rule Path
Disallow /

grub-client

Rule Path
Disallow /

k2spider

Rule Path
Disallow /

npbot

Rule Path
Disallow /

webreaper

Rule Path
Disallow /

psbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

speedy

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

blexbot

Rule Path
Disallow /

blexbot/1.0

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

bloglines/3.1

Rule Path
Disallow /

jyxobot/1

Rule Path
Disallow /

cityreview

Rule Path
Disallow /

proximic

Rule Path
Disallow /

semrushbot

Rule Path
Disallow /

semrushbot-sa

Rule Path
Disallow /

semrushbot-ba

Rule Path
Disallow /

semrushbot-si

Rule Path
Disallow /

semrushbot-swa

Rule Path
Disallow /

semrushbot-ct

Rule Path
Disallow /

semrushbot-bm

Rule Path
Disallow /

amazonbot/0.1

Rule Path
Disallow /

Other Records

Field Value
sitemap https://www.1areal.ru/sitemap/index.txt
sitemap https://www.1areal.ru/sitemap/search.txt
sitemap https://www.1areal.ru/sitemap/catalog.txt
sitemap https://www.1areal.ru/sitemap/index.xml
sitemap https://www.1areal.ru/sitemap/index.txt
sitemap https://www.1areal.ru/sitemap/search.txt
sitemap https://www.1areal.ru/sitemap/catalog.txt
sitemap https://www.1areal.ru/sitemap/index.xml

Comments

  • STOP!!! BOTS
  • Crawlers that are kind enough to obey, but which we'd rather not have
  • unless they're feeding search engines.
  • Some bots are known to be trouble, particularly those designed to copy
  • entire sites. Please obey robots.txt.
  • The 'grub' distributed client has been *very* poorly behaved.
  • Doesn't follow robots.txt anyway, but...
  • Hits many times per second, not acceptable
  • http://www.nameprotect.com/botinfo.html
  • A capture bot, downloads gazillions of pages with no public benefit
  • http://www.webreaper.net/

Warnings

  • `host` is not a known field.