ria.ru
robots.txt

Robots Exclusion Standard data for ria.ru

Resource Scan

Scan Details

Site Domain ria.ru
Base Domain ria.ru
Scan Status Ok
Last Scan2024-05-11T02:10:10+00:00
Next Scan 2024-05-18T02:10:10+00:00

Last Scan

Scanned2024-05-11T02:10:10+00:00
URL https://ria.ru/robots.txt
Domain IPs 178.248.234.228
Response IP 178.248.234.228
Found Yes
Hash 69035939acd75bf7c01085c649faa49a717e6e5c1a126f842ec3afb642f89723
SimHash 6db8fd818ab3

Groups

*

Rule Path
Disallow *-print.html$
Disallow /sys_*
Disallow /valdai/
Disallow /search/
Disallow /ria_70*
Disallow /test_adfox*
Disallow /ecokarta*
Disallow /id/
Disallow */embed/
Disallow /specialprojects/
Disallow /_editorial_preview_*
Disallow /amp/*/more.html*
Disallow /?*
Disallow /services/

Other Records

Field Value
sitemap https://ria.ru/sitemap_article_index.xml
sitemap https://ria.ru/sitemap_list_index.xml
sitemap https://ria.ru/sitemap_archive.xml

Warnings

  • `clean-param` is not a known field.