essoar.org
robots.txt

Robots Exclusion Standard data for essoar.org

Resource Scan

Scan Details

Site Domain essoar.org
Base Domain essoar.org
Scan Status Ok
Last Scan2025-08-19T15:26:09+00:00
Next Scan 2025-09-18T15:26:09+00:00

Last Scan

Scanned2025-08-19T15:26:09+00:00
URL https://essoar.org/robots.txt
Redirect https://www.authorea.com/robots.txt
Redirect Domain www.authorea.com
Redirect Base authorea.com
Domain IPs 104.21.63.4, 172.67.168.244
Redirect IPs 104.18.35.163, 172.64.152.93, 2606:4700:4400::6812:23a3, 2606:4700:4400::ac40:985d
Response IP 104.18.35.163
Found Yes
Hash 0433dd22e974636508cc6ae3e55e09ad1667e67952bc9309fabc231cdc348a2e
SimHash 3a2505710d17

Groups

*

Rule Path
Allow /
Disallow /oauth/*
Disallow /users/auth/*
Disallow /activate_accounts
Disallow /article_git_access_bridges
Disallow /launch_ipython
Disallow /resize_figure
Disallow /activate_temp_account
Disallow /citations
Disallow /password_resets
Disallow /sessions
Disallow /ace
Disallow /start_writing_now

anthropic-ai

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://authorea.com/sitemap/sitemap.xml.gz

Comments

  • Welcome to Authorea!
  • Thank you for being a nice bot and playing by the rules.
  • If you are writing a paper based on crawled data from us,
  • feel encouraged to write it in Authorea itself and share it with us!