ex-press.by
robots.txt

Robots Exclusion Standard data for ex-press.by

Resource Scan

Scan Details

Site Domain ex-press.by
Base Domain ex-press.by
Scan Status Failed
Failure StageFetching resource.
Failure ReasonCouldn't connect to server.
Last Scan2024-09-14T01:11:39+00:00
Next Scan 2024-12-13T01:11:39+00:00

Last Successful Scan

Scanned2021-09-28T15:08:10+00:00
URL http://ex-press.by/robots.txt
Redirect https://ex-press.live/robots.txt
Redirect Domain ex-press.live
Redirect Base ex-press.live
Found Yes
Hash 6e02dd2159b085b668364ffd01e9ce7fdca1ed1a9a3c8c7f7294aac4e64e1738
SimHash b2800dc56770

Groups

yandexnews

Rule Path
Allow /rss/yandex

Other Records

Field Value
crawl-delay 2

*

Rule Path
Allow /rss/zen

Other Records

Field Value
sitemap https://ex-press.by/sitemap.xml.gz

Comments

  • See http://www.robotstxt.org/robotstxt.html for documentation on how to use the robots.txt file
  • To ban all spiders from the entire site uncomment the next two lines:

Warnings

  • `host` is not a known field.