Published on in Vol 10 (2024)
Preprints (earlier versions) of this paper are
available at
https://preprints.jmir.org/preprint/48514, first published
.

Journals
- Chan J, Dong T, Angelini G. The performance of large language models in intercollegiate Membership of the Royal College of Surgeons examination. The Annals of The Royal College of Surgeons of England 2024;106(8):700 View
- Wang Y, Chen Y, Sheng J. Assessing ChatGPT as a Medical Consultation Assistant for Chronic Hepatitis B: Cross-Language Study of English and Chinese. JMIR Medical Informatics 2024;12:e56426 View
- Fan K, Fan K. Dermatological Knowledge and Image Analysis Performance of Large Language Models Based on Specialty Certificate Examination in Dermatology. Dermato 2024;4(4):124 View
- Ho C, Tian T, Ayers A, Aaron R, Phillips V, Wolf R, Mathioudakis N, Dai T, Klonoff D. Qualitative metrics from the biomedical literature for evaluating large language models in clinical decision-making: a narrative review. BMC Medical Informatics and Decision Making 2024;24(1) View
- Abhari S, Afshari Y, Fatehi F, Salmani H, Garavand A, Chumachenko D, Zakerabasali S, Morita P. Exploring ChatGPT in clinical inquiry: a scoping review of characteristics, applications, challenges, and evaluation. Annals of Medicine & Surgery 2024;86(12):7094 View
- Burisch C, Bellary A, Breuckmann F, Ehlers J, Thal S, Sellmann T, Gödde D. ChatGPT-4 Performance on German Continuing Medical Education—Friend or Foe (Trick or Treat)? Protocol for a Randomized Controlled Trial. JMIR Research Protocols 2025;14:e63887 View
- Temiz M, Güzel C. Assessing the Performance of ChatGPT on Dentistry Specialization Exam Questions: A Comparative Study with DUS Examinees. Medical Records 2025;7(1):162 View
- Bany Abdelnabi A, Soykan B, Bhatti D, Rabadi G. Usefulness of Large Language Models (LLMs) for Student Feedback on H&P During Clerkship: Artificial Intelligence for Personalized Learning. ACM Transactions on Computing for Healthcare 2025 View
- Xiong Y, Zhan Z, Zhong C, Zeng W, Guo J, Tang W, Liu C. Evaluating the Performance of Large Language Models (LLMs) in Answering and Analysing the Chinese Dental Licensing Examination. European Journal of Dental Education 2025;29(2):332 View
- Cailhol L, Durpoix A, Poirier S, Prada P, Salles J, Slovak S, Yrondi A. Impact d’une formation francophone, internationale et en ligne sur le trouble de la personnalité. Annales Médico-psychologiques, revue psychiatrique 2025;183(7):703 View
- Yang C, Chen Y, Qian C, Shi F, Guo Y. The data-intensive research paradigm: challenges and responses in clinical professional graduate education. Frontiers in Medicine 2025;12 View
- Huang H, Shu S. The effectiveness of ChatGPT in pediatric simulation-based tests of nursing courses in Taiwan: A descriptive study. Clinical Simulation in Nursing 2025;102:101732 View
- Elkin P, Mehta G, LeHouillier F, Resnick M, Mullin S, Tomlin C, Resendez S, Liu J, Nebeker J, Brown S. Semantic Clinical Artificial Intelligence vs Native Large Language Model Performance on the USMLE. JAMA Network Open 2025;8(4):e256359 View
- Federico L, Fusaro D, Coppola G, Gregori M, Durante S. Application of ChatGPT 4.0 in radiological dose management: Perceptions of radiographers with varying expertise. Radiography 2025;31(4):102972 View
- Li Z, Yan C, Cao Y, Gong A, Li F, Zeng R. Evaluating performance of large language models for atrial fibrillation management using different prompting strategies and languages. Scientific Reports 2025;15(1) View
- La Bella S, Bayraktar D, Porreca A, Li L, Attanasi M, Aliyev E, Migowa A, Scott C, Thakare D, Bayindir Y, Consolaro A, Feldman B, Ozen S. Global variations in artificial intelligence-generated information on juvenile idiopathic arthritis. Rheumatology 2025 View
- Arslan S, Usta Küçükbezirci G. Evaluating the Accuracy, Completeness, and Readability of Chatbot Responses to Refractive Surgery-Related Patient Questions: A Comparative Analysis of ChatGPT and Google Gemini. Cureus 2025 View
- Liu Y, Yu F, Zhang X, Tong X, Li K, Gu W, Yu B. Assessing the Role of Large Language Models Between ChatGPT and DeepSeek in Asthma Education for Bilingual Individuals: Comparative Study. JMIR Medical Informatics 2025;13:e65365 View
- Wu R, Zong H, Wu E, Li J, Zhou Y, Zhang C, Zhang Y, Wang J, Tang T, Shen B. Improving large language models for miRNA information extraction via prompt engineering. Computer Methods and Programs in Biomedicine 2025;271:109033 View
- Tian L, Lu Y, Fei X, Lu J. Intelligent Head and Neck CTA Report Quality Detection with Large Language Models. Journal of Imaging Informatics in Medicine 2025 View
- Lin Y, Luo Z, Ye Z, Zhong N, Zhao L, Zhang L, Li X, Chen Z, Chen Y. Applications, Challenges, and Prospects of Generative Artificial Intelligence Empowering Medical Education: Scoping Review. JMIR Medical Education 2025;11:e71125 View
- Jaleel A, Aziz U, Farid G, Zahid Bashir M, Mirza T, Khizar Abbas S, Aslam S, Sikander R. Evaluating the Potential and Accuracy of ChatGPT-3.5 and 4.0 in Medical Licensing and In-Training Examinations: Systematic Review and Meta-Analysis. JMIR Medical Education 2025;11:e68070 View
Books/Policy Documents
- Mengfan Z, Osman K. 2024 Yearbook Emerging Technologies in Learning. View
Conference Proceedings
- Huang J, Wei Y, Zhang L, Chen W. 2024 International Symposium on Educational Technology (ISET). Evaluating generative artificial intelligence in answering course-related open questions: A pilot study View
