news.rambler.ru
robots.txt

Robots Exclusion Standard data for news.rambler.ru

Resource Scan

Scan Details

Site Domain news.rambler.ru
Base Domain rambler.ru
Scan Status Ok
Last Scan2024-11-06T17:41:04+00:00
Next Scan 2024-11-20T17:41:04+00:00

Last Scan

Scanned2024-11-06T17:41:04+00:00
URL https://news.rambler.ru/robots.txt
Domain IPs 81.19.82.104, 81.19.82.105, 81.19.82.106
Response IP 81.19.82.104
Found Yes
Hash b9f8edbfe1898b2782ff14c7cda2d84d5352d8f8b55d6884289f87772606c88f
SimHash 4e4828a1c333

Groups

yandex

Rule Path
Disallow /self/
Disallow /history/
Disallow /search
Disallow /set
Disallow /rss/*
Disallow /*/comments/*
Allow /rss/yandex/
Allow /rss/yandex/*
Disallow /*/items/
Disallow /sports/*
Disallow /google-video-sitemap.xml

googlebot

Rule Path
Allow /self/
Disallow /sports/*
Disallow /history/
Disallow /search
Disallow /set
Disallow /yandex-video-sitemap.xml
Disallow /*/items/
Disallow /*/comments/*

anews/1.0

Rule Path
Allow /rss/anews/
Disallow /self/
Disallow /sports/*
Disallow /*/comments/*

gptbot

Rule Path
Disallow /

shodan

Rule Path
Disallow /

*

Rule Path
Disallow /self/
Disallow /sports/*
Disallow /history/
Disallow /search
Disallow /set
Disallow /*/items/
Disallow /*/comments/*
Disallow /*?*show_deferred

Other Records

Field Value
sitemap https://news.rambler.ru/sitemap-index.xml

Warnings

  • `clean-param` is not a known field.
  • `host` is not a known field.