jep-cast.livejournal.com
robots.txt

Robots Exclusion Standard data for jep-cast.livejournal.com

Resource Scan

Scan Details

Site Domain jep-cast.livejournal.com
Base Domain livejournal.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2024-09-24T17:37:06+00:00
Next Scan 2024-10-08T17:37:06+00:00

Last Successful Scan

Scanned2024-09-02T17:20:24+00:00
URL https://jep-cast.livejournal.com/robots.txt
Domain IPs 81.19.74.0, 81.19.74.1
Response IP 81.19.74.0
Found Yes
Hash 404de49a40a77e096b5e9e9f469094def32f402fa261ce1adf5a8e5cb4a96e18
SimHash 6c44cad2f633

Groups

*

Rule Path
Allow /data/rss/
Disallow /*.html*mode%3Dreply
Disallow /*.html*replyto
Disallow /data/foaf/
Disallow /tag/
Disallow /friendstimes
Disallow /d0*
Disallow /d1*
Disallow /d2*
Disallow /d3*
Disallow /d4*
Disallow /d5*
Disallow /d6*
Disallow /d7*
Disallow /d8*
Disallow /d9*
Disallow /ratings/users
Disallow *.html?

mediapartners-google*

Rule Path
Allow /

yandex

Rule Path
Allow /
Disallow /data/foaf/
Disallow /ratings/users
Disallow /photo
Disallow *.html?

Other Records

Field Value
crawl-delay 100

googlebot

Rule Path
Allow /
Disallow /ratings/users
Disallow /data/foaf/
Disallow /photo
Disallow *.html?

spbot

Rule Path
Disallow /

ahrefsbot

Rule Path
Disallow /

Warnings

  • `clean-param` is not a known field.
  • `host` is not a known field.