essopenarchive.org
robots.txt

Robots Exclusion Standard data for essopenarchive.org

Resource Scan

Scan Details

Site Domain essopenarchive.org
Base Domain essopenarchive.org
Scan Status Ok
Last Scan2025-08-11T23:36:11+00:00
Next Scan 2025-09-10T23:36:11+00:00

Last Scan

Scanned2025-08-11T23:36:11+00:00
URL https://essopenarchive.org/robots.txt
Domain IPs 104.21.8.188, 172.67.157.208, 2606:4700:3033::ac43:9dd0, 2606:4700:3037::6815:8bc
Response IP 172.67.157.208
Found Yes
Hash 63e677a4816f95d9e9da4f7b992a063c857241c68590d712f3f9fd64cafa13f7
SimHash 382585710d17

Groups

*

Rule Path
Allow /
Disallow /oauth/*
Disallow /users/auth/*
Disallow /activate_accounts
Disallow /article_git_access_bridges
Disallow /launch_ipython
Disallow /resize_figure
Disallow /activate_temp_account
Disallow /citations
Disallow /password_resets
Disallow /sessions
Disallow /ace
Disallow /start_writing_now

anthropic-ai

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

chatgpt-user

Rule Path
Disallow /

cohere-ai

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

Other Records

Field Value
sitemap https://essopenarchive.org/essoar_sitemap/sitemap.xml.gz

Comments

  • Welcome to ESS Open Archive !
  • Thank you for being a nice bot and playing by the rules.
  • If you are writing a paper based on crawled data from us,
  • feel encouraged to write it in ESS Open Archive itself and share it with us!