Accessibility settings

Published on in Vol 11 (2025)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/58375, first published .
Performance of Plug-In Augmented ChatGPT and Its Ability to Quantify Uncertainty: Simulation Study on the German Medical Board Examination

Performance of Plug-In Augmented ChatGPT and Its Ability to Quantify Uncertainty: Simulation Study on the German Medical Board Examination

Performance of Plug-In Augmented ChatGPT and Its Ability to Quantify Uncertainty: Simulation Study on the German Medical Board Examination

Journals

  1. Wang R, Ding Y, Shen Y, Liu H, Wang P, Gao Z. Comparative Evaluation of Teaching Plans on Prostate Cancer Generated by Various Large Language Models and a Human Expert. Engineering Reports 2025;7(8) View
  2. Kasagga A, Sapkota A, Changaramkumarath G, Abucha J, Wollel M, Somannagari N, Husami M, Hailu K, Kasagga E. Performance of ChatGPT and Large Language Models on Medical Licensing Exams Worldwide: A Systematic Review and Network Meta-Analysis With Meta-Regression. Cureus 2025 View
  3. Shao M, Zhang H. Two-stage prompting framework with predefined verification steps for evaluating diagnostic reasoning tasks on two datasets. npj Digital Medicine 2025;8(1) View
  4. Li J, Ai F, Wang J, Cheng B, Li Y, Chen Z. Application of AI-Generated Content in Medical Education: Systematic Review of the Impact on Critical Thinking Abilities of Medical Students. JMIR Medical Education 2026;12:e79939 View
  5. Koç A, Ataş A, Yosunkaya Ş, Vatansev H. Performance of large language models on sleep medicine certification examination: a comprehensive multi-model analysis. Frontiers in Medicine 2026;13 View