jarmark.blesk.cz
robots.txt
Robots Exclusion Standard data for jarmark.blesk.cz
Resource Scan
Scan Details
Site Domain | jarmark.blesk.cz |
Base Domain | blesk.cz |
Scan Status | Ok |
Last Scan | 2024-11-15T22:58:04+00:00 |
Next Scan | 2024-11-22T22:58:04+00:00 |
Last Scan
Scanned | 2024-11-15T22:58:04+00:00 |
URL | https://jarmark.blesk.cz/robots.txt |
Domain IPs | 104.26.0.127, 104.26.1.127, 172.67.71.132, 2606:4700:20::681a:17f, 2606:4700:20::681a:7f, 2606:4700:20::ac43:4784 |
Response IP | 104.26.1.127 |
Found | Yes |
Hash | 79cd884d2bf928a4e2131cff302bde26c5be524956f9c04ca853352068a76c9b |
SimHash | 69151c60cd96 |
Groups
*
Rule | Path |
---|---|
Disallow | /search |
Disallow | /videoConfig/* |
Disallow | /captcha/* |
Disallow | /*?*nocache* |
Disallow | /*?*nocat* |
Disallow | /*?*__log* |
Disallow | /*?*mver* |
Disallow | /*.js* |
Disallow | /version.txt |
Other Records
Field | Value |
---|---|
sitemap | http://jarmark.blesk.cz/sitemap.xml |