Published on in Vol 11 (2025)
Preprints (earlier versions) of this paper are
available at
https://preprints.jmir.org/preprint/58375, first published
.

Journals
- Wang R, Ding Y, Shen Y, Liu H, Wang P, Gao Z. Comparative Evaluation of Teaching Plans on Prostate Cancer Generated by Various Large Language Models and a Human Expert. Engineering Reports 2025;7(8) View
- Kasagga A, Sapkota A, Changaramkumarath G, Abucha J, Wollel M, Somannagari N, Husami M, Hailu K, Kasagga E. Performance of ChatGPT and Large Language Models on Medical Licensing Exams Worldwide: A Systematic Review and Network Meta-Analysis With Meta-Regression. Cureus 2025 View
- Shao M, Zhang H. Two-stage prompting framework with predefined verification steps for evaluating diagnostic reasoning tasks on two datasets. npj Digital Medicine 2025;8(1) View
- Li J, Ai F, Wang J, Cheng B, Li Y, Chen Z. Application of AI-Generated Content in Medical Education: Systematic Review of the Impact on Critical Thinking Abilities of Medical Students. JMIR Medical Education 2026;12:e79939 View
- Koç A, Ataş A, Yosunkaya Ş, Vatansev H. Performance of large language models on sleep medicine certification examination: a comprehensive multi-model analysis. Frontiers in Medicine 2026;13 View
