utro.media
robots.txt
Robots Exclusion Standard data for utro.media
Resource Scan
Scan Details
Site Domain | utro.media |
Base Domain | utro.media |
Scan Status | Ok |
Last Scan | 2024-11-11T06:57:00+00:00 |
Next Scan | 2024-11-18T06:57:00+00:00 |
Last Scan
Scanned | 2024-11-11T06:57:00+00:00 |
URL | https://utro.media/robots.txt |
Domain IPs | 95.213.212.85 |
Response IP | 95.213.212.85 |
Found | Yes |
Hash | b8a923e42f3b69687a8375134a224ab3cee29dcff614e1214d73b01a6e8238d4 |
SimHash | 6910aa61c7b1 |
Groups
*
Rule | Path |
---|---|
Disallow | /cgi-bin/ |
Disallow | /search |
Disallow | /preview |
Disallow | /forum/site_comments/ |
Other Records
Field | Value |
---|---|
crawl-delay | 1 |
Other Records
Field | Value |
---|---|
sitemap | https://utro.life/google_sitemap.xml |
Warnings
- `host` is not a known field.