news.rambler.ru
robots.txt

Robots Exclusion Standard data for news.rambler.ru

Resource Scan

Scan Details

Site Domain news.rambler.ru
Base Domain rambler.ru
Scan Status Ok
Last Scan2024-05-22T09:36:53+00:00
Next Scan 2024-06-05T09:36:53+00:00

Last Scan

Scanned2024-05-22T09:36:53+00:00
URL https://news.rambler.ru/robots.txt
Domain IPs 81.19.82.104, 81.19.82.105, 81.19.82.106
Response IP 81.19.82.104
Found Yes
Hash a6ece9e61963af6299da5b3151babb0d0dad3e7f3f4c3a7f4f7cf60f0304e6ac
SimHash 4e4832e1c7b3

Groups

yandex

Rule Path
Disallow /self/
Disallow /history/
Disallow /search
Disallow /set
Disallow /rss/*
Disallow /*/comments/*
Allow /rss/yandex/
Allow /rss/yandex/*
Disallow /*/items/
Disallow /sports/*
Disallow /google-video-sitemap.xml

googlebot

Rule Path
Allow /self/
Disallow /sports/*
Disallow /history/
Disallow /search
Disallow /set
Disallow /yandex-video-sitemap.xml
Disallow /*/items/
Disallow /*/comments/*

anews/1.0

Rule Path
Allow /rss/anews/
Disallow /self/
Disallow /sports/*
Disallow /*/comments/*

gptbot

Rule Path
Disallow /

shodan

Rule Path
Disallow /

*

Rule Path
Disallow /self/
Disallow /sports/*
Disallow /history/
Disallow /search
Disallow /set
Disallow /*/items/
Disallow /*/comments/*
Disallow /*?*show_deferred

Other Records

Field Value
sitemap https://news.rambler.ru/sitemap-index.xml

Warnings

  • `clean-param` is not a known field.
  • `host` is not a known field.