apriatin.livejournal.com
robots.txt

Robots Exclusion Standard data for apriatin.livejournal.com

Resource Scan

Scan Details

Site Domain apriatin.livejournal.com
Base Domain livejournal.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-10-07T13:12:48+00:00
Next Scan 2024-12-06T13:12:48+00:00

Last Successful Scan

Scanned2024-08-02T12:57:44+00:00
URL https://apriatin.livejournal.com/robots.txt
Domain IPs 81.19.74.0, 81.19.74.1
Response IP 81.19.74.1
Found Yes
Hash 8d63e43d9f0cd48576208a1537511643a499e11782067904c84b8da4a35ceb13
SimHash 6c44fad2d633

Groups

*

Rule Path
Allow /data/rss/
Disallow /*.html*mode%3Dreply
Disallow /*.html*replyto
Disallow /data/foaf/
Disallow /tag/
Disallow /friendstimes
Disallow /d0*
Disallow /d1*
Disallow /d2*
Disallow /d3*
Disallow /d4*
Disallow /d5*
Disallow /d6*
Disallow /d7*
Disallow /d8*
Disallow /d9*
Disallow /ratings/users

mediapartners-google*

Rule Path
Allow /

yandex

Rule Path
Allow /
Disallow /data/foaf/
Disallow /ratings/users
Disallow /photo

Other Records

Field Value
crawl-delay 100

googlebot

Rule Path
Allow /
Disallow /ratings/users
Disallow /data/foaf/
Disallow /photo

spbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

Warnings

  • `clean-param` is not a known field.
  • `host` is not a known field.