peerj.com
robots.txt

Robots Exclusion Standard data for peerj.com

Resource Scan

Scan Details

Site Domain peerj.com
Base Domain peerj.com
Scan Status Failed
Failure StageFetching resource.
Failure ReasonServer returned a client error.
Last Scan2025-11-15T19:58:11+00:00
Next Scan 2026-02-13T19:58:11+00:00

Last Successful Scan

Scanned2025-06-25T12:52:25+00:00
URL https://peerj.com/robots.txt
Domain IPs 104.18.39.203, 172.64.148.53, 2606:4700:4400::6812:27cb, 2606:4700:4400::ac40:9435
Response IP 104.18.39.203
Found Yes
Hash da70dd40abf2c7df188b2635530ae39b8bf0aed99776dc665329ec56c9ef8835
SimHash c24d5814c753

Groups

*

Rule Path
Disallow /ops/
Disallow /reviewer-match/*/
Disallow /errors/

Other Records

Field Value
crawl-delay 2

turnitin

Rule Path
Disallow /institutions/*/

Other Records

Field Value
crawl-delay 10

Other Records

Field Value
sitemap https://peerj.com/sitemap_index.xml

Warnings

  • 3 invalid lines.