apriatin.livejournal.com
robots.txt
Robots Exclusion Standard data for apriatin.livejournal.com
Resource Scan
Scan Details
Site Domain | apriatin.livejournal.com |
Base Domain | livejournal.com |
Scan Status | Failed |
Failure Stage | Fetching resource. |
Failure Reason | Server returned a client error. |
Last Scan | 2024-10-07T13:12:48+00:00 |
Next Scan | 2024-12-06T13:12:48+00:00 |
Last Successful Scan
Scanned | 2024-08-02T12:57:44+00:00 |
URL | https://apriatin.livejournal.com/robots.txt |
Domain IPs | 81.19.74.0, 81.19.74.1 |
Response IP | 81.19.74.1 |
Found | Yes |
Hash | 8d63e43d9f0cd48576208a1537511643a499e11782067904c84b8da4a35ceb13 |
SimHash | 6c44fad2d633 |
Groups
*
Rule | Path |
---|---|
Allow | /data/rss/ |
Disallow | /*.html*mode%3Dreply |
Disallow | /*.html*replyto |
Disallow | /data/foaf/ |
Disallow | /tag/ |
Disallow | /friendstimes |
Disallow | /d0* |
Disallow | /d1* |
Disallow | /d2* |
Disallow | /d3* |
Disallow | /d4* |
Disallow | /d5* |
Disallow | /d6* |
Disallow | /d7* |
Disallow | /d8* |
Disallow | /d9* |
Disallow | /ratings/users |
Warnings
- `clean-param` is not a known field.
- `host` is not a known field.