Published on in Vol 11 (2025)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/67244, first published .
Large Language Models in Biochemistry Education: Comparative Evaluation of Performance

Large Language Models in Biochemistry Education: Comparative Evaluation of Performance

Large Language Models in Biochemistry Education: Comparative Evaluation of Performance

Journals

  1. Bolgova O, Ganguly P, Mavrych V. Comparative analysis of LLMs performance in medical embryology: A cross‐platform study of ChatGPT, Claude, Gemini, and Copilot. Anatomical Sciences Education 2025;18(7):718 View
  2. Mavrych V, Yousef E, Yaqinuddin A, Bolgova O. Large language models in medical education: a comparative cross-platform evaluation in answering histological questions. Medical Education Online 2025;30(1) View
  3. Chytas D, Noussios G, Salmas M, Vasiliadis A, Troupis T. Investigation of Studies on ChatGPT's Ability to Answer Anatomy Questions: A Self-Evaluation by ChatGPT and Comparison with an Evaluation by Gemini. Cureus 2025 View
  4. Bolgova O, Ganguly P, Ikram M, Mavrych V. Evaluating large language models as graders of medical short answer questions: a comparative analysis with expert human graders. Medical Education Online 2025;30(1) View
  5. Parente S, Rocha S, Moreira M, Oliveira-Filho A, Simeone D. Temporal trends of artificial intelligence in medical education: a global perspective. Discover Artificial Intelligence 2025;5(1) View
  6. Ros-Arlanzón P, Gutarra-Ávila R, Arrarte-Esteban V, Bertomeu-González V, Hernández-Blasco L, Masiá M, Navarro-Canto L, Nieto-Navarro J, Abarca J, Sempere A. When AI models take the exam: large language models vs medical students on multiple-choice course exams. Medical Education Online 2025;30(1) View