ex-press.live
robots.txt

Robots Exclusion Standard data for ex-press.live

Resource Scan

Scan Details

Site Domain ex-press.live
Base Domain ex-press.live
Scan Status Ok
Last Scan2024-09-27T16:11:08+00:00
Next Scan 2024-10-04T16:11:08+00:00

Last Scan

Scanned2024-09-27T16:11:08+00:00
URL https://ex-press.live/robots.txt
Domain IPs 34.118.21.151
Response IP 34.118.21.151
Found Yes
Hash 6e02dd2159b085b668364ffd01e9ce7fdca1ed1a9a3c8c7f7294aac4e64e1738
SimHash b2800dc56770

Groups

yandexnews

Rule Path
Allow /rss/yandex

Other Records

Field Value
crawl-delay 2

*

Rule Path
Allow /rss/zen

Other Records

Field Value
sitemap https://ex-press.by/sitemap.xml.gz

Comments

  • See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines:

Warnings

  • `host` is not a known field.