bibliotekar.org
robots.txt

Robots Exclusion Standard data for bibliotekar.org

Resource Scan

Scan Details

Site Domain bibliotekar.org
Base Domain bibliotekar.org
Scan Status Ok
Last Scan2024-09-29T12:44:04+00:00
Next Scan 2024-10-06T12:44:04+00:00

Last Scan

Scanned2024-09-29T12:44:04+00:00
URL https://bibliotekar.org/robots.txt
Domain IPs 149.202.144.255
Response IP 149.202.144.255
Found Yes
Hash 539ee33ef6e96e794aa33a245349edf77a9a581e007ac2f2baa028a7ada02ea1
SimHash fd0dbc732535

Groups

*

Rule Path
Disallow /engine/go.php
Disallow /user/
Disallow /newposts/
Disallow /statistics.html
Disallow /*subaction%3Duserinfo
Disallow /*subaction%3Dnewposts
Disallow /*do%3Dlastcomments
Disallow /*do%3Dfeedback
Disallow /*do%3Dregister
Disallow /*do%3Dlostpassword
Disallow /*do%3Daddnews
Disallow /*do%3Dstats
Disallow /*do%3Dpm
Disallow /*do%3Dsearch
Disallow /*do%3Ddownload
Disallow /*doaction%3D
Disallow /*step%3D
Disallow /*do%3Dgo
Disallow /f/

Other Records

Field Value
sitemap https://knizhkin.org/sitemap.xml

Warnings

  • `clean-param` is not a known field.
  • `host` is not a known field.