kp.by
robots.txt

Robots Exclusion Standard data for kp.by

Resource Scan

Scan Details

Site Domain kp.by
Base Domain kp.by
Scan Status Failed
Failure ReasonScan timed out.
Last Scan2024-10-05T19:13:19+00:00
Next Scan 2025-01-03T19:13:19+00:00

Last Successful Scan

Scanned2021-09-27T17:15:41+00:00
URL https://kp.by/robots.txt
Found Yes
Hash 5d1e7c6ab85c1b9d1190dc56d290ead697eb7f758bb2e465701f4f1a175ff41e
SimHash 7901cf61cd1a

Groups

*

Rule Path
Disallow /banners/
Disallow /link/
Disallow /scripts/
Disallow /profile/
Disallow /go/
Disallow /print/
Disallow /cgi-bin/
Disallow /frames/
Disallow /daily/article_cover/
Disallow /best/fbs/
Disallow /search/
Disallow /*/wp-admin/*
Disallow /*/wp-json/*
Disallow /*/wp-login.php
Disallow /*/wp-register.php
Disallow /*/24-hours-rss/
Disallow /comments/*
Disallow /content/api/*
Disallow /daily/26945/3996748/
Disallow *couponRedirect

yandex

Rule Path
Disallow /afisha/*/amp/$
Disallow /*?*
Disallow *couponRedirect

Other Records

Field Value
sitemap https://www.kp.by/sitemap.xml.gz

Warnings

  • `clean-param` is not a known field.
  • `host` is not a known field.