Published on in Vol 9 (2023)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/50945, first published .
The Role of Large Language Models in Medical Education: Applications and Implications

The Role of Large Language Models in Medical Education: Applications and Implications

The Role of Large Language Models in Medical Education: Applications and Implications

Journals

  1. Biri S, Kumar S, Panigrahi M, Mondal S, Behera J, Mondal H. Assessing the Utilization of Large Language Models in Medical Education: Insights From Undergraduate Medical Students. Cureus 2023 View
  2. Garcia M. ChatGPT as a Virtual Dietitian: Exploring Its Potential as a Tool for Improving Nutrition Knowledge. Applied System Innovation 2023;6(5):96 View
  3. van Heerden A, Bosman S, Swendeman D, Comulada W. Chatbots for HIV Prevention and Care: a Narrative Review. Current HIV/AIDS Reports 2023;20(6):481 View
  4. Shimizu I, Kasai H, Shikino K, Araki N, Takahashi Z, Onodera M, Kimura Y, Tsukamoto T, Yamauchi K, Asahina M, Ito S, Kawakami E. Developing Medical Education Curriculum Reform Strategies to Address the Impact of Generative AI: Qualitative Study. JMIR Medical Education 2023;9:e53466 View
  5. Mykhalko Y, Kish P, Rubtsova Y, Kutsyn O, Koval V. FROM TEXT TO DIAGNOSE: CHATGPT’S EFFICACY IN MEDICAL DECISION-MAKING. Wiadomości Lekarskie 2023;76(11):2345 View
  6. Abbas M, van Rosmalen P, Kalz M. A Data-Driven Approach for the Identification of Features for Automated Feedback on Academic Essays. IEEE Transactions on Learning Technologies 2023;16(6):914 View
  7. Knopp M, Warm E, Weber D, Kelleher M, Kinnear B, Schumacher D, Santen S, Mendonça E, Turner L. AI-Enabled Medical Education: Threads of Change, Promising Futures, and Risky Realities Across Four Potential Future Worlds. JMIR Medical Education 2023;9:e50373 View
  8. Abdaljaleel M, Barakat M, Alsanafi M, Salim N, Abazid H, Malaeb D, Mohammed A, Hassan B, Wayyes A, Farhan S, Khatib S, Rahal M, Sahban A, Abdelaziz D, Mansour N, AlZayer R, Khalil R, Fekih-Romdhane F, Hallit R, Hallit S, Sallam M. A multinational study on the factors influencing university students’ attitudes and usage of ChatGPT. Scientific Reports 2024;14(1) View
  9. Benítez T, Xu Y, Boudreau J, Kow A, Bello F, Van Phuoc L, Wang X, Sun X, Leung G, Lan Y, Wang Y, Cheng D, Tham Y, Wong T, Chung K. Harnessing the potential of large language models in medical education: promise and pitfalls. Journal of the American Medical Informatics Association 2024;31(3):776 View
  10. Braithwaite J, Fisher G. Beyond the aspirational: creating the future of health care in Australia. Internal Medicine Journal 2024;54(2):342 View
  11. Thaker N, Redjal N, Loaiza-Bonilla A, Penberthy D, Showalter T, Choudhri A, Williamson S, Thaker G, Shah C, Ward M, Thaker M, Arcaro M. Large Language Models Encode Radiation Oncology Domain Knowledge: Performance on the American College of Radiology Standardized Examination. AI in Precision Oncology 2024;1(1):43 View
  12. Gordon M, Daniel M, Ajiboye A, Uraiby H, Xu N, Bartlett R, Hanson J, Haas M, Spadafore M, Grafton-Clarke C, Gasiea R, Michie C, Corral J, Kwan B, Dolmans D, Thammasitboon S. A scoping review of artificial intelligence in medical education: BEME Guide No. 84. Medical Teacher 2024;46(4):446 View
  13. Chow J, Wong V, Li K. Generative Pre-Trained Transformer-Empowered Healthcare Conversations: Current Trends, Challenges, and Future Directions in Large Language Model-Enabled Medical Chatbots. BioMedInformatics 2024;4(1):837 View
  14. Dahri N, Yahaya N, Al-Rahmi W, Vighio M, Alblehai F, Soomro R, Shutaleva A. Investigating AI-based academic support acceptance and its impact on students’ performance in Malaysian and Pakistani higher education institutions. Education and Information Technologies 2024;29(14):18695 View
  15. Gandhi A, Joesph F, Rajagopal V, Aparnavi P, Katkuri S, Dayama S, Satapathy P, Khatib M, Gaidhane S, Zahiruddin Q, Behera A. Performance of ChatGPT on the India Undergraduate Community Medicine Examination: Cross-Sectional Study. JMIR Formative Research 2024;8:e49964 View
  16. Cheng J. Applications of Large Language Models in Pathology. Bioengineering 2024;11(4):342 View
  17. Artsi Y, Sorin V, Konen E, Glicksberg B, Nadkarni G, Klang E. Large language models for generating medical examinations: systematic review. BMC Medical Education 2024;24(1) View
  18. Hudon A, Kiepura B, Pelletier M, Phan V. Using ChatGPT in Psychiatry to Design Script Concordance Tests in Undergraduate Medical Education: Mixed Methods Study. JMIR Medical Education 2024;10:e54067 View
  19. Eltaybani S. The Transformative Role of Large Language Models in Post-Acute and Long-Term Care. Journal of the American Medical Directors Association 2024;25(6):104982 View
  20. Schaye V, Triola M. The generative artificial intelligence revolution: How hospitalists can lead the transformation of medical education. Journal of Hospital Medicine 2024;19(12):1181 View
  21. Fan K. Implications of Large Language Models in Medical Education. Atomic Article Preprint Journal 2024 View
  22. Martin A, DiGiovanni M, Acquaye A, Ponticiello M, Chou D, Neto E, Michel A, Sibeoni J, Piot M, Spodenkiewicz M, Benoit L. Pathways and identity: toward qualitative research careers in child and adolescent psychiatry. Child and Adolescent Psychiatry and Mental Health 2024;18(1) View
  23. Rajaram A. Large language models in medical education: new tools for experimentation and discovery. Canadian Medical Education Journal 2024 View
  24. Li Z, Li F, Fu Q, Wang X, Liu H, Zhao Y, Ren W. Large language models and medical education: a paradigm shift in educator roles. Smart Learning Environments 2024;11(1) View
  25. Kang K, Yang Y, Wu Y, Luo R. Integrating Large Language Models in Bioinformatics Education for Medical Students: Opportunities and Challenges. Annals of Biomedical Engineering 2024;52(9):2311 View
  26. Nilsen P, Sundemo D, Heintz F, Neher M, Nygren J, Svedberg P, Petersson L. Towards evidence-based practice 2.0: leveraging artificial intelligence in healthcare. Frontiers in Health Services 2024;4 View
  27. Schroeder K, Elkassabany N. Artificial intelligence and regional anesthesiology education curriculum development: navigating the digital noise. Regional Anesthesia & Pain Medicine 2024:rapm-2024-105522 View
  28. Bhattacharya M, Pal S, Chatterjee S, Lee S, Chakraborty C. Large language model to multimodal large language model: A journey to shape the biological macromolecules to biological sciences and medicine. Molecular Therapy - Nucleic Acids 2024;35(3):102255 View
  29. Lucas M, Yang J, Pomeroy J, Yang C. Reasoning with large language models for medical question answering. Journal of the American Medical Informatics Association 2024;31(9):1964 View
  30. Zhui L, Fenghe L, Xuehu W, Qining F, Wei R. Ethical Considerations and Fundamental Principles of Large Language Models in Medical Education: Viewpoint. Journal of Medical Internet Research 2024;26:e60083 View
  31. Haider S, Pressman S, Borna S, Gomez-Cabello C, Sehgal A, Leibovich B, Forte A. Evaluating Large Language Model (LLM) Performance on Established Breast Classification Systems. Diagnostics 2024;14(14):1491 View
  32. Mistry N, Saeed H, Rafique S, Le T, Obaid H, Adams S. Large Language Models as Tools to Generate Radiology Board-Style Multiple-Choice Questions. Academic Radiology 2024;31(9):3872 View
  33. Adams T, Claydon M. ChatGPT as a primary healthcare consultation training tool for combat medical technicians. BMJ Military Health 2024:e002722 View
  34. Sarangi P, Panda B, P. S, Pattanayak D, Panda S, Mondal H. Exploring Radiology Postgraduate Students' Engagement with Large Language Models for Educational Purposes: A Study of Knowledge, Attitudes, and Practices. Indian Journal of Radiology and Imaging 2025;35(01):035 View
  35. Alonso I, Oronoz M, Agerri R. MedExpQA: Multilingual benchmarking of Large Language Models for Medical Question Answering. Artificial Intelligence in Medicine 2024;155:102938 View
  36. Zhui L, Yhap N, Liping L, Zhengjie W, Zhonghao X, Xiaoshu Y, Hong C, Xuexiu L, Wei R. Impact of Large Language Models on Medical Education and Teaching Adaptations. JMIR Medical Informatics 2024;12:e55933 View
  37. Suresh S, Misra S. Large Language Models in Pediatric Education: Current Uses and Future Potential. Pediatrics 2024;154(3) View
  38. Guo E, Ramchandani R, Park Y, Gupta M. OSCEai: personalized interactive learning for undergraduate medical education. Canadian Medical Education Journal 2024 View
  39. Lonsdale H, O’Reilly-Shah V, Padiyath A, Simpao A. Supercharge Your Academic Productivity with Generative Artificial Intelligence. Journal of Medical Systems 2024;48(1) View
  40. Tong W, Zhang X, Zeng H, Pan J, Gong C, Zhang H. Reforming China’s Secondary Vocational Medical Education: Adapting to the Challenges and Opportunities of the AI Era. JMIR Medical Education 2024;10:e48594 View
  41. Gan W, Ouyang J, Li H, Xue Z, Zhang Y, Dong Q, Huang J, Zheng X, Zhang Y. Integrating ChatGPT in Orthopedic Education for Medical Undergraduates: Randomized Controlled Trial. Journal of Medical Internet Research 2024;26:e57037 View
  42. On Y, Kim T, Kim N. Psychotherapy Based on the Large Language Models: On the Aspect of the Theory of Mind, a Narrative Review. Journal of Korean Neuropsychiatric Association 2024;63(3):151 View
  43. Wang J, Shi R, Le Q, Shan K, Chen Z, Zhou X, He Y, Hong J. Evaluating the effectiveness of large language models in patient education for conjunctivitis. British Journal of Ophthalmology 2025;109(2):185 View
  44. Fraga-Sastrías J, Navarrini H, Silva-Brehuer M, Espejo-González R, Olvera-Cortés H, Rubio-Martínez R. Uso de Chat-GPT para la generación y conducción de escenarios simulados para el aprendizaje de habilidades no técnicas. Revista Latinoamericana de Simulación Clínica 2024;6(2):64 View
  45. Seth I, Lim B, Phan R, Xie Y, Kenney P, Bukret W, Thomsen J, Cuomo R, Ross R, Ng S, Rozen W. Perforator Selection with Computed Tomography Angiography for Unilateral Breast Reconstruction: A Clinical Multicentre Analysis. Medicina 2024;60(9):1500 View
  46. Bettoli V, Naldi L, Santoro E, Valetto M, Bolzon A, Cassalia F, Cazzaniga S, Cima S, Danese A, Emendi S, Ponzano M, Scarpa N, Dri P. ChatGPT and acne: Accuracy and reliability of the information provided—The AI‐check study. Journal of the European Academy of Dermatology and Venereology 2025;39(4) View
  47. McQuade C, Wijesekera T, Chartash D. Dispelling the magic of artificial intelligence in medical education. Medical Education 2025;59(3):350 View
  48. Su Z, Tang G, Huang R, Qiao Y, Zhang Z, Dai X. Based on Medicine, The Now and Future of Large Language Models. Cellular and Molecular Bioengineering 2024;17(4):263 View
  49. Fan K, Fan K. Dermatological Knowledge and Image Analysis Performance of Large Language Models Based on Specialty Certificate Examination in Dermatology. Dermato 2024;4(4):124 View
  50. Morreale M, Balon R, Beresin E, Seritan A, Castillo E, Thomas L, Louie A, Aggarwal R, Guerrero A, Coverdale J, Brenner A. Artificial Intelligence and Medical Education, Academic Writing, and Journal Policies: A Focus on Large Language Models. Academic Psychiatry 2025;49(1):5 View
  51. Sallam M, Al-Mahzoum K, Almutairi Y, Alaqeel O, Abu Salami A, Almutairi Z, Alsarraf A, Barakat M. Anxiety among Medical Students Regarding Generative Artificial Intelligence Models: A Pilot Descriptive Study. International Medical Education 2024;3(4):406 View
  52. Ramgopal S, Varma S, Gorski J, Kester K, Shieh A, Suresh S. Evaluation of a Large Language Model on the American Academy of Pediatrics' PREP Emergency Medicine Question Bank. Pediatric Emergency Care 2024;40(12):871 View
  53. Yeo Y, Peng Y, Mehra M, Samaan J, Hakimian J, Clark A, Suchak K, Krut Z, Andersson T, Persky S, Liran O, Spiegel B. Evaluating for Evidence of Sociodemographic Bias in Conversational AI for Mental Health Support. Cyberpsychology, Behavior, and Social Networking 2025;28(1):44 View
  54. Zheng J, Ding X, Pu J, Chung S, Ai Q, Hung K, Shan Z. Unlocking the Potentials of Large Language Models in Orthodontics: A Scoping Review. Bioengineering 2024;11(11):1145 View
  55. Deb Roy A, Bharat Jaiswal I, Nath Tiu D, Das D, Mondal S, Behera J, Mondal H. Assessing the Utilization of Large Language Model Chatbots for Educational Purposes by Medical Teachers: A Nationwide Survey From India. Cureus 2024 View
  56. Aster A, Laupichler M, Rockwell-Kollmann T, Masala G, Bala E, Raupach T. ChatGPT and Other Large Language Models in Medical Education — Scoping Literature Review. Medical Science Educator 2024;35(1):555 View
  57. Naldi L, Bettoli V, Santoro E, Valetto M, Bolzon A, Cassalia F, Cazzaniga S, Cima S, Danese A, Emendi S, Ponzano M, Scarpa N, Dri P. Application of ChatGPT as a content generation tool in continuing medical education: acne as a test topic. Dermatology Reports 2024 View
  58. Anisuzzaman D, Malins J, Friedman P, Attia Z. Fine-Tuning Large Language Models for Specialized Use Cases. Mayo Clinic Proceedings: Digital Health 2025;3(1):100184 View
  59. Zhang X, Wu C, Zhao Z, Lin W, Zhang Y, Wang Y, Xie W. Development of a large-scale medical visual question-answering dataset. Communications Medicine 2024;4(1) View
  60. Maaß L, Grab-Kroll C, Koerner J, Öchsner W, Schön M, Messerer D, Böckers T, Böckers A. Artificial Intelligence and ChatGPT in Medical Education: A Cross-Sectional Questionnaire on students’ Competence. Journal of CME 2025;14(1) View
  61. Gupta N, Khatri K, Malik Y, Lakhani A, Kanwal A, Aggarwal S, Dahuja A. Exploring prospects, hurdles, and road ahead for generative artificial intelligence in orthopedic education and training. BMC Medical Education 2024;24(1) View
  62. Kaewboonlert N, Poontananggul J, Pongsuwan N, Bhakdisongkhram G. Factors Associated With the Accuracy of Large Language Models in Basic Medical Science Examinations: Cross-Sectional Study. JMIR Medical Education 2025;11:e58898 View
  63. Kondo T, Okamoto M, Kondo Y. Pilot Study on Using Large Language Models for Educational Resource Development in Japanese Radiological Technologist Exams. Medical Science Educator 2025 View
  64. Adarkwah M, Badu S, Osei E, Adu-Gyamfi E, Odame J, Schneider K. ChatGPT in healthcare education: a double-edged sword of trends, challenges, and opportunities. Discover Education 2025;4(1) View
  65. Mess S, Mackey A, Yarowsky D. Artificial Intelligence Scribe and Large Language Model Technology in Healthcare Documentation: Advantages, Limitations, and Recommendations. Plastic and Reconstructive Surgery - Global Open 2025;13(1):e6450 View
  66. Malik S, Frey L, Gutman J, Mushtaq A, Warraich F, Qureshi K. Evaluating Artificial Intelligence-Driven Responses to Acute Liver Failure Queries: A Comparative Analysis Across Accuracy, Clarity, and Relevance. American Journal of Gastroenterology 2024 View
  67. Maaz S, Palaganas J, Palaganas G, Bajwa M. A guide to prompt design: foundations and applications for healthcare simulationists. Frontiers in Medicine 2025;11 View
  68. Feigerlova E, Hani H, Hothersall-Davies E. A systematic review of the impact of artificial intelligence on educational outcomes in health professions education. BMC Medical Education 2025;25(1) View
  69. Cera R. Intelligenza generativa artificiale in medical education: ragionamento clinico artificiale vs ragionamento clinico umano. EDUCATION SCIENCES AND SOCIETY 2025;(2):239 View
  70. Waldock W, Lam G, Baptista A, Walls R, Sam A. Which curriculum components do medical students find most helpful for evaluating AI outputs?. BMC Medical Education 2025;25(1) View
  71. Sohrabniya F, Hassanzadeh-Samani S, Ourang S, Jafari B, Farzinnia G, Gorjinejad F, Ghalyanchi-Langeroudi A, Mohammad-Rahimi H, Tichy A, Motamedian S, Schwendicke F. Exploring a decade of deep learning in dentistry: A comprehensive mapping review. Clinical Oral Investigations 2025;29(2) View
  72. Carl N, Haggenmüller S, Wies C, Nguyen L, Winterstein J, Hetz M, Mangold M, Hartung F, Grüne B, Holland‐Letz T, Michel M, Brinker T, Wessels F. Evaluating interactions of patients with large language models for medical information. BJU International 2025 View
  73. Sakelaris P, Novotny K, Borvick M, Lagasca G, Simanton E. Evaluating the Use of Artificial Intelligence as a Study Tool for Preclinical Medical School Exams. Journal of Medical Education and Curricular Development 2025;12 View
  74. Dennstädt F, Hastings J, Putora P, Schmerder M, Cihoric N. Implementing large language models in healthcare while balancing control, collaboration, costs and security. npj Digital Medicine 2025;8(1) View
  75. Shahid F, Hsu M, Chang Y, Jian W. Using Generative AI to Extract Structured Information from Free Text Pathology Reports. Journal of Medical Systems 2025;49(1) View
  76. Cross J, Kayalackakom T, Robinson R, Vaughans A, Sebastian R, Hood R, Lewis C, Devaraju S, Honnavar P, Naik S, Joseph J, Anand N, Mohammed A, Johnson A, Cohen E, Adeniji T, Nnenna Nnaji A, George J. The Digital Shift: Assessing ChatGPT’s Capability as a New Age Standardized Patient – A Qualitative Study (Preprint). JMIR Medical Education 2024 View
  77. Santos M, Lima F, Teixeira R, Ribeiro F, Silveira A, Campos Filho A. Chatbot para Assistência Remota em Hipertensão Arterial Sistêmica: Revisão da Literatura. Revista Brasileira de Informática na Educação 2025;33:34 View
  78. Singh S, Chaurasia A, Raichandani S, Grewal H, Udare A, Jawahar A. Commentary: Leveraging Large Language Models for Radiology Education and Training. Journal of Computer Assisted Tomography 2025 View
  79. Patil N, Kou N, Baptista‐Hon D, Monteiro O. Artificial Intelligence in Medical Education: A Practical Guide for Educators. MedComm – Future Medicine 2025;4(2) View
  80. Eoh K. Prospects and applications of artificial intelligence and large language models in obstetrics and gynecology education: a narrative review. Journal of the Korean Medical Association 2025;68(3):161 View
  81. Güvel M, Kıyak Y, Varan H, Sezenöz B, Coşkun Ö, Uluoğlu C. Generative AI vs. human expertise: a comparative analysis of case-based rational pharmacotherapy question generation. European Journal of Clinical Pharmacology 2025 View
  82. Lafourcade C, Kérourédan O, Ballester B, Richert R. Accuracy, consistency, and contextual understanding of large language models in restorative dentistry and endodontics. Journal of Dentistry 2025;157:105764 View
  83. Hoch C, Funk P, Guntinas-Lichius O, Volk G, Lüers J, Hussain T, Wirth M, Schmidl B, Wollenberg B, Alfertshofer M. Harnessing advanced large language models in otolaryngology board examinations: an investigation using python and application programming interfaces. European Archives of Oto-Rhino-Laryngology 2025 View
  84. Sumner J, Wang Y, Tan S, Chew E, Wenjun Yip A. Perspectives and Experiences With Large Language Models in Health Care: Survey Study. Journal of Medical Internet Research 2025;27:e67383 View

Books/Policy Documents

  1. Pears M, Konstantinidis S. Disruptive Technologies in Education and Workforce Development. View
  2. Nawaz F, Opriessnig E, Usman F, Agrohi J, Arshad Z, Kashyap R, Anwar S. Precision Health in the Digital Age. View
  3. Xu M, Wang Y. Human Brain and Artificial Intelligence. View

Conference Proceedings

  1. Murali R, Ravi N, Surendran A. 2024 IEEE Global Engineering Education Conference (EDUCON). Augmenting Virtual Labs with Artificial Intelligence for Hybrid Learning View
  2. Moore S, Schmucker R, Mitchell T, Stamper J. Proceedings of the Eleventh ACM Conference on Learning @ Scale. Automated Generation and Tagging of Knowledge Components from Multiple-Choice Questions View
  3. Nakamoto S, Okamoto Y, Nakakouchi T, Shimada K. 2024 16th IIAI International Congress on Advanced Applied Informatics (IIAI-AAI). Towards Human-Level Evaluation: Assessing the Potential of GPT-4 in Automated Evaluation and Feedback Generation on Japanese Essays View
  4. Shankar A, T R M, Kumar S, Anurag A, Narayan A, P J. 2024 2nd International Conference on Networking, Embedded and Wireless Systems (ICNEWS). Advancements in AI-Driven Dentistry: Tooth GenAI’s Impact on Dental Diagnosis and Treatment Planning View
  5. Bin Yousuf R, Defelice N, Sharma M, Xu S, Ramakrishnan N. 2024 IEEE International Conference on Big Data (BigData). LLM Augmentations to support Analytical Reasoning over Multiple Documents View
  6. Fu Z, Lee C. Proceedings of the 2024 8th International Conference on Natural Language Processing and Information Retrieval. Leveraging Large Language Models for Automated Knowledge Acquisition in Personal Health Status Evaluation View
  7. Wang A, Ruparel R, Iurchenko A, Jhun P, Séguin J, Strachan P, Wong R, Karthikesalingam A, Matias Y, Hassidim A, Webster D, Semturs C, Krause J, Schaekermann M. Proceedings of the Extended Abstracts of the CHI Conference on Human Factors in Computing Systems. Generative AI for medical education: Insights from a case study with medical students and an AI tutor for clinical reasoning View
  8. Ouyang Y, Xu Y, Jiang C, Li Q. Proceedings of the Extended Abstracts of the CHI Conference on Human Factors in Computing Systems. CaseMaster: Designing a Probe for Oral Case Presentation Training with LLM Assistance View
  9. Khot R, Arets T, Wester J, Burger F, van Berkel N, Brankaert R, IJsselsteijn W, Lee M. Proceedings of the 2025 CHI Conference on Human Factors in Computing Systems. Challenging Futures: Using Chatbots to Reflect on Aging and Dementia View