tjas.org
robots.txt

Robots Exclusion Standard data for tjas.org

Resource Scan

Scan Details

Site Domain tjas.org
Base Domain tjas.org
Scan Status Ok
Last Scan2026-03-02T02:06:39+00:00
Next Scan 2026-04-01T02:06:39+00:00

Last Scan

Scanned2026-03-02T02:06:39+00:00
URL https://tjas.org/robots.txt
Redirect https://www.tjas.org/robots.txt
Redirect Domain www.tjas.org
Redirect Base tjas.org
Domain IPs 104.21.87.22, 172.67.139.191, 2606:4700:3031::6815:5716, 2606:4700:3031::ac43:8bbf
Redirect IPs 104.21.87.22, 172.67.139.191, 2606:4700:3031::6815:5716, 2606:4700:3031::ac43:8bbf
Response IP 104.21.87.22
Found Yes
Hash 67e99c3c9ba0d7cf2269e220b48026bace47920d744e8f7a8bd977df6675f466
SimHash 18390940c6d2

Groups

turnitinbot

Rule Path
Allow /

googlebot-scholar

Rule Path
Allow /

googlebot

Rule Path
Allow /

*

Rule Path
Allow /
Disallow /class/
Disallow /data/
Disallow /lib/
Disallow /request/
Disallow /tmp/
Disallow /main/
Disallow /mainn/
Disallow /template/
Disallow /view/

amazonbot

Rule Path
Disallow /

applebot-extended

Rule Path
Disallow /

bytespider

Rule Path
Disallow /

ccbot

Rule Path
Disallow /

claudebot

Rule Path
Disallow /

google-extended

Rule Path
Disallow /

gptbot

Rule Path
Disallow /

meta-externalagent

Rule Path
Disallow /

blexbot

Rule Path
Disallow /
Allow /data/ijos1/coversheet/

Other Records

Field Value
sitemap https://www.tjas.org/sitemap.xml

Comments

  • ==================================================
  • ACADEMIC & SEARCH ENGINE PERMISSIONS (PRIORITY)
  • ==================================================
  • ==================================================
  • RESTRICTIONS FOR AI & SCRAPERS
  • ==================================================
  • ==================================================
  • SITEMAP
  • ==================================================

Warnings

  • `content-signal` is not a known field.