tarbija24.ee
robots.txt
Robots Exclusion Standard data for tarbija24.ee
Resource Scan
Scan Details
Site Domain | tarbija24.ee |
Base Domain | tarbija24.ee |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Couldn't connect to server. |
Last Scan | 2024-11-09T19:41:38+00:00 |
Next Scan | 2024-12-09T19:41:38+00:00 |
Last Successful Scan
Scanned | 2024-10-11T19:39:19+00:00 |
URL | http://tarbija24.ee/robots.txt |
Domain IPs | 185.154.221.183, 185.154.221.184 |
Response IP | 185.154.221.184 |
Found | Yes |
Hash | bf96e42f81af4069cefff42badf728184a8375b1776c0ad3d4589453a4133cde |
SimHash | 2b0f4660c9b0 |
Groups
*
Rule | Path |
---|---|
Disallow | /search* |
Disallow | /latest/* |
Disallow | /*/print/* |
Disallow | /print/* |
Disallow | /*/com/* |
Disallow | /mobile/* |
Disallow | /rest/* |
Disallow | /feed/* |
Disallow | /weather/* |
Disallow | /?schedule=* |
Disallow | /author/* |
bingbot
msnbot
msnbot-media
yandexbot
ahrefsbot
seekportbot
Rule | Path |
---|---|
Disallow | /search* |
Disallow | /latest/* |
Disallow | /*/print/* |
Disallow | /print/* |
Disallow | /*/com/* |
Disallow | /mobile/* |
Disallow | /?schedule=* |
Disallow | /rest/* |
Disallow | /feed/* |
Disallow | /weather/* |
Disallow | /author/* |
Other Records
Field | Value |
---|---|
crawl-delay | 60 |
Other Records
Field | Value |
---|---|
sitemap | https://tarbija24.ee/sitemap |