diary.space
robots.txt

Robots Exclusion Standard data for diary.space

Resource Scan

Scan Details

Site Domain diary.space
Base Domain diary.space
Scan Status Ok
Last Scan2025-07-22T05:08:33+00:00
Next Scan 2025-07-29T05:08:33+00:00

Last Scan

Scanned2025-07-22T05:08:33+00:00
URL https://diary.space/robots.txt
Domain IPs 104.21.21.220, 172.67.200.211, 2606:4700:3035::ac43:c8d3, 2606:4700:3036::6815:15dc
Response IP 104.21.21.220
Found Yes
Hash e49cfc1a809faedcf3038470676d02ff8c7b5ece7fd47700825c2111cc30f346
SimHash 49259c6d4d31

Groups

*

Rule Path
Allow /
Disallow /favorite/
Disallow /?from
Disallow /sms/
Disallow /u-mail/
Disallow /options/
Disallow /?sort
Disallow /?&sort
Disallow *%26ord
Disallow *%26l
Disallow *%26lsearch_opt
Disallow *%26act
Disallow *%26fullcommunity_moderatorslist
Disallow *%26fullcommunity_membershiplist
Disallow *%26fullfavoriteslist
Disallow *%26oam
Disallow /counter/
Disallow /designdir/
Disallow /?%2F*
Disallow /options/
Disallow /registration/
Disallow /photolib/
Disallow /diary.php
Disallow /counter/
Disallow /?favorite*
Disallow /?comments*
Disallow /tools/
Disallow *%26checknewreader
Disallow *?order=*
Disallow /members/?search_opt=*
Disallow /?first_post&from=*
Disallow /?last_post&from=*
Disallow /?post_next&postid=*
Disallow *?&last_post
Disallow /?tag=*
Disallow /discussion/
Disallow /?rfrom=*
Disallow *?all=true

Other Records

Field Value
sitemap https://diary.space/sitemap.xml

Warnings

  • `clean-param` is not a known field.
  • `host` is not a known field.