doctorlib.org
robots.txt

Robots Exclusion Standard data for doctorlib.org

Resource Scan

Scan Details

Site Domain doctorlib.org
Base Domain doctorlib.org
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-03-17T09:55:08+00:00
Next Scan 2025-03-24T09:55:08+00:00

Last Successful Scan

Scanned2025-03-09T09:54:03+00:00
URL https://doctorlib.org/robots.txt
Domain IPs 195.230.22.106
Response IP 195.230.22.106
Found Yes
Hash 2077ff72ff129a00b5207d7056427099eb6c103be4d128269cc9858fd8ac1ce1
SimHash 0b14844207b2

Groups

*

Rule Path
Disallow
Disallow /assets
Allow /assets/images
Allow /assets/css
Allow /assets/bower_components/fontawesome/

archive.org_bot

Rule Path
Disallow /

ia_archiver

Rule Path
Disallow /

Other Records

Field Value
sitemap https://doctorlib.org/sitemap.xml

Warnings

  • `host` is not a known field.