Published on in Vol 11 (2025)
Preprints (earlier versions) of this paper are
available at
https://preprints.jmir.org/preprint/67244, first published
.

Journals
- Bolgova O, Ganguly P, Mavrych V. Comparative analysis of LLMs performance in medical embryology: A cross‐platform study of ChatGPT, Claude, Gemini, and Copilot. Anatomical Sciences Education 2025;18(7):718 View
- Mavrych V, Yousef E, Yaqinuddin A, Bolgova O. Large language models in medical education: a comparative cross-platform evaluation in answering histological questions. Medical Education Online 2025;30(1) View
- Chytas D, Noussios G, Salmas M, Vasiliadis A, Troupis T. Investigation of Studies on ChatGPT's Ability to Answer Anatomy Questions: A Self-Evaluation by ChatGPT and Comparison with an Evaluation by Gemini. Cureus 2025 View
- Bolgova O, Ganguly P, Ikram M, Mavrych V. Evaluating large language models as graders of medical short answer questions: a comparative analysis with expert human graders. Medical Education Online 2025;30(1) View
- Parente S, Rocha S, Moreira M, Oliveira-Filho A, Simeone D. Temporal trends of artificial intelligence in medical education: a global perspective. Discover Artificial Intelligence 2025;5(1) View
- Ros-Arlanzón P, Gutarra-Ávila R, Arrarte-Esteban V, Bertomeu-González V, Hernández-Blasco L, Masiá M, Navarro-Canto L, Nieto-Navarro J, Abarca J, Sempere A. When AI models take the exam: large language models vs medical students on multiple-choice course exams. Medical Education Online 2025;30(1) View
