kp.by
robots.txt
Robots Exclusion Standard data for kp.by
Resource Scan
Scan Details
Site Domain | kp.by |
Base Domain | kp.by |
Scan Status | Failed |
Failure Reason | Scan timed out. |
Last Scan | 2024-10-05T19:13:19+00:00 |
Next Scan | 2025-01-03T19:13:19+00:00 |
Last Successful Scan
Scanned | 2021-09-27T17:15:41+00:00 |
URL | https://kp.by/robots.txt |
Found | Yes |
Hash | 5d1e7c6ab85c1b9d1190dc56d290ead697eb7f758bb2e465701f4f1a175ff41e |
SimHash | 7901cf61cd1a |
Groups
*
Rule | Path |
---|---|
Disallow | /banners/ |
Disallow | /link/ |
Disallow | /scripts/ |
Disallow | /profile/ |
Disallow | /go/ |
Disallow | /print/ |
Disallow | /cgi-bin/ |
Disallow | /frames/ |
Disallow | /daily/article_cover/ |
Disallow | /best/fbs/ |
Disallow | /search/ |
Disallow | /*/wp-admin/* |
Disallow | /*/wp-json/* |
Disallow | /*/wp-login.php |
Disallow | /*/wp-register.php |
Disallow | /*/24-hours-rss/ |
Disallow | /comments/* |
Disallow | /content/api/* |
Disallow | /daily/26945/3996748/ |
Disallow | *couponRedirect |
Other Records
Field | Value |
---|---|
sitemap | https://www.kp.by/sitemap.xml.gz |
Warnings
- `clean-param` is not a known field.
- `host` is not a known field.