submit.biorxiv.org
robots.txt

Robots Exclusion Standard data for submit.biorxiv.org

Resource Scan

Scan Details

Site Domain submit.biorxiv.org
Base Domain biorxiv.org
Scan Status Ok
Last Scan2025-07-31T09:07:07+00:00
Next Scan 2025-08-30T09:07:07+00:00

Last Scan

Scanned2025-07-31T09:07:07+00:00
URL https://submit.biorxiv.org/robots.txt
Domain IPs 104.18.36.106, 172.64.151.150, 2606:4700:4400::6812:246a, 2606:4700:4400::ac40:9796
Response IP 172.64.151.150
Found Yes
Hash e45d9fa18b36fb484b291612f8d7aa8d299841de78b7707c0fc9e3bdd41e0e4c
SimHash eb17f8d4ae19

Groups

*

Rule Path
Disallow /accesslogs/
Disallow /ads/
Disallow /all.shtml
Disallow /apps/
Disallow /archive/
Disallow /backtocs/
Disallow /bmj/
Disallow /browse-alt.shtml
Disallow /browse.shtml
Disallow /browse/
Disallow /careerfocus/
Disallow /cgi/
Disallow /classifieds/
Disallow /collections/
Disallow /conf/
Disallow /content
Disallow /content/
Disallow /contents-by-date.0.shtml
Disallow /current.shtml
Disallow /feature/
Disallow /future.shtml
Disallow /future/
Disallow /guides/
Disallow /help/
Disallow /home/
Disallow /icons/
Disallow /include/
Disallow /index-alt.shtml
Disallow /jobalerts.shtml
Disallow /jobsearch.shtml
Disallow /math/
Disallow /minireviews.shtml
Disallow /nutinfo/
Disallow /older/
Disallow /pips/
Disallow /recruit/
Disallow /search.dtl
Disallow /search.shtml
Disallow /search/
Disallow /searchall/
Disallow /subscriptions/
Disallow /supplemental/
Disallow /tips/
Disallow /usage/
Disallow /trn.dtl

Other Records

Field Value
crawl-delay 10

fasterfox

Rule Path
Disallow /

daumoa

Rule Path
Disallow /

twiceler

Rule Path
Disallow /

dotbot

Rule Path
Disallow /

exabot

Rule Path
Disallow /

yandex

Rule Path
Disallow /