JMIR Medical Education

Developing a Realistic and Cost-Effective Training Model (MaiSurge) for Laparoscopic Hysterectomies to Train and Assess Surgical Skill: Prospective Nonrandomized Controlled Trial

2026-02-12T14:15:09-05:00

Background: Laparoscopic surgery has a flatter learning curve compared to traditional open surgery. Therefore, structured programs and realistic training models are imperative to ensure patients’ safety. However, commercially available models are often too expensive or technically unrealistic for continuous surgical training. Objective: The aim of this trial was to develop a cost-efficient and highly realistic uterus model to perform a total laparoscopic hysterectomy (TLH) and to evaluate its applicability. Methods: A training model (MaiSurge) for a total laparoscopic hysterectomy with salpingectomy/adenectomy was developed using a 3D printer and different cast materials. Polyvinylalcohol was used to allow the use of electrosurgery. To gather first validity evidence, Novice and Expert gynecologists performed a TLH on the model. Operative time as well as surgical performance scores (H-OSATS) were compared between both groups. Results: Twelve participants in the Novice group and eighteen participants in the Expert group completed the simulation. The Experts performed significantly better than the Novices in the modified H-OSATS-Score (74±12.9 vs 60.3 ± 14.9, p=0.049) and faster than the Novices (69.5 minutes (49.5-74.3) vs 37.5 minutes (30.5-38.8), p<0.001). An excellent inter-rater reliability was seen (0.91 (intraclass correlation coefficient). Around 92% of Novices felt that they had improved their surgical performance after training on the MaiSurge uterus model. Overall, all participants agreed that the new MaiSurge uterus model should be integrated into training curricula to improve the performance of residents for TLHs. Conclusions: A new highly realistic and cost-effective training model (MaiSurge) to perform a TLH was developed. The model distinguishes between good and poor laparoscopic performances and can thus be used in training as well as assessment of surgical skills. The possibility of simulating even complex laparoscopic procedures in a realistic environment may be an opportunity to train a future generation of gynecologists without compromising patient safety or exhausting the limited availability of operating room time. Clinical Trial: Deutsche Register für Klinische Studien (DRKS00031825); https://drks.de/search/en/trial/DRKS00031825/details

AI-Enhanced Continuing Professional Development as an Evolving Sociotechnical System: Multimethod Theoretical Framework Development Study

2026-02-12T14:15:09-05:00

Background: Artificial intelligence (AI) is changing continuing professional development (CPD) in healthcare and its interactions with the broader healthcare system. Yet current scholarship lacks an integrated theoretical model that explains how AI impacts CPD as a complex sociotechnical system. Existing frameworks usually focus on isolated phenomena, such as ethics, literacy, or learning theory, leaving unaddressed the dynamics of how those phenomena interact in the complex socio-technical AI-enhanced CPD system, as well as the new roles that AI-empowered patients and society play. Objective: Artificial intelligence (AI) is changing continuing professional development (CPD) in healthcare and its interactions with the broader healthcare system. Yet current scholarship lacks an integrated theoretical model that explains how AI impacts CPD as a complex sociotechnical system. Existing frameworks usually focus on isolated phenomena, such as ethics, literacy, or learning theory, leaving unaddressed the dynamics of how those phenomena interact in the complex socio-technical AI-enhanced CPD system, as well as the new roles that AI-empowered patients and society play. Methods: We conducted a multi-method theory construction. The process started with identifying the AI-enhanced CPD as an established yet evolving phenomenon. Through a structured literature review, the main building blocks of AI-enhanced CPD were identified, as well as the ontological base (CT and ANT). The model was developed through iterative human-led and AI-assisted abductive analysis. The final model was abductively validated on a case study of a national organization that is pioneering AI use, demonstrating that the theoretical model makes sense in practice. All conceptual decisions were reviewed collaboratively by the author group. Results: The ALEERRT-CA framework is made of six pillars: AI Literacy, Explainability, Ethics, Readiness, Reliability, and Learning Theories, and two theoretical lenses: Complexity Theory and Actor-Network Theory. CT elucidates macro-level system behaviors in the AI-enhanced CPD system. Those behaviors include emergence, feedback loops, adaptation, and reality made of nested complex systems. ANT, on the other hand, explains how localized interactions among human and nonhuman actors shape AI-enhanced CPD. Together, these lenses show how AI redistributes agency, amplifies tensions, and generates emergent learning dynamics within CPD and the broader healthcare system. Conclusions: This study presents a novel, conceptual model of AI-enhanced CPD as a sociotechnical system. The integration of CT and ANT with AI constructs improves explanatory power on ALEERRT-CA. Educators, program leaders, and policymakers can use the frameworks as a structured toolset to evaluate AI readiness, design responsible AI-enhanced CPD practices, and plan future empirical research. The framework provides a theoretical lens for observing the rapidly evolving field of AI-enhanced CPD and healthcare practices.

Using AI to Train Future Clinicians in Depression Assessment: Feasibility Study

2026-02-12T13:30:18-05:00

Background: Depression is a major global health care challenge, causing significant individual distress but also contributing to a substantial global burden. Timely and accurate diagnosis is crucial. To help future clinicians develop these essential skills, we trained a generative pretrained transformer (GPT)–powered chatbot to simulate patients with varying degrees of depression and suicidality. Objective: This study aims to evaluate the applicability and transferability of our GPT-4-powered chatbot for psychosomatic cases. Specifically, we aim to investigate how accurately the chatbot can simulate patients exhibiting various stages of depression and phases of suicidal ideation, while adhering to a predefined role script and maintaining a sufficient level of authenticity. Additionally, we want to analyze to what level the chatbot is suitable for practicing correctly diagnosing depressive disorders in patients, as well as assessing suicidality stages. Methods: We developed 3 virtual patient role scripts depicting complex, realistic cases of depression and varying degrees of suicidality collaboratively with field experts and aligned with mental health assessment guidelines. These cases were integrated into a GPT-4–powered chatbot for practicing clinical history-taking. A total of 148 medical students, with an average age of 22.71 years and mostly in their sixth semester, interacted individually with one of the randomly assigned virtual patients through chat. Following this, they completed a questionnaire assessing their demographics and user experience. Chats were analyzed descriptively to assess diagnostic accuracy and suicidality assessments, as well as the role script adherence and authenticity of the artificial intelligence (AI). This was done to gain further insight into the chatbot’s behavior and the students’ diagnostic accuracy. Results: In over 90% (725/778) of the answers, the chatbot maintained its assigned role. On average, students correctly identified the severity of depression in 60% (81/135) and the phase of suicidality in 67% (91/135) of the cases. Notably, the majority either failed to address or insufficiently explored the topic of suicidality despite explicit instructions beforehand. Conclusions: This study demonstrates that a GPT-powered chatbot can simulate patients with depression fairly accurately. More than two-thirds of participants perceived the AI-simulated patients with depression as authentic, and nearly 80% (106/135) indicated they would like to use the application for further practice, highlighting its potential as a training tool. While a small proportion of students expressed reservations, and the overall diagnostic accuracy varied depending on the severity of the case, the findings overall support the feasibility and educational value of AI-based role-playing in clinical training. AI-supported virtual patients provide a highly flexible, standardized, and readily available training tool, independent of real-life constraints.

Medical Students’ Experiences With Virtual Reality Simulation Training: Qualitative Study

2026-02-11T17:00:29-05:00

Background: Beyond its applications in other settings, virtual reality (VR) technology has gained attention in medical education, offering immersive learning experiences. Previous research has demonstrated its potential as an educational tool in medical settings, highlighting enhanced educational outcomes, skill acquisition and retention, standardized training experiences, and the promotion of active learning. However, there is still a dearth of research exploring various aspects of VR user experiences, with most studies focusing on its effect on skill acquisition. Limited qualitative research further hinders an in-depth understanding of user experiences, restricting a comprehensive overview of VR’s potential in medical education. Objective: This study explored subjective experiences with VR simulation training and its perceived benefits and challenges among medical students in the United Kingdom, using the 5 domains of the Immersive Technology Evaluation Measure (ITEM). Methods: In July 2024, 15- to 20-minute in-person interviews were conducted with 11 medical students who had completed the immersive VR training consisting of the assessment and treatment of a virtual patient. Guided by the 5 domains of the ITEM as preconceived themes, a deductive thematic analysis was used to explore individual experiences with the training, embedded within narrative responses. Results: Findings aligned with the 5 a priori ITEM domains of system usability, immersion, motivation, cognitive load, and debriefing. Within these predefined domains, new subthemes emerged that enhanced the understanding of user experience. Participants reported usability barriers involving accessibility, technical issues, and limited variability in scenarios. Immersion was generally strong due to realistic environments, although reduced interactivity constrained authenticity. Motivation was reflected in active engagement and a greater sense of preparedness for clinical practice. Cognitive load was associated with divided attention, physical effects, and a need for clearer guidance and familiarization. Ultimately, participants valued debriefing sessions as valuable opportunities for reflection and reinforcing knowledge. Conclusions: VR training fosters immersion and motivation, but its effectiveness depends on balancing technical usability with cognitive demands. Future integration should prioritize design variability and structured debriefing to optimize learning outcomes. Refinement of immersive VR training in clinical education is also warranted, alongside further research in broader contexts and longitudinal use.

Digital Choice Architecture in Medical Education: Applying Behavioral Economics to Online Learning Environments

2026-02-06T17:30:10-05:00

Healthcare has widely adopted behavioral economics to influence clinical practice, with documented success using defaults and social comparison feedback in electronic health records. Yet online medical education, now the dominant modality for continuing professional development, remains designed on assumptions of rational learning that behavioral science has disproven in clinical contexts. This viewpoint examines the paradox of applying sophisticated behavioral insights to clinical work while designing digital learning environments as if learners are immune to cognitive limitations. We propose digital choice architecture for medical education: intentional integration of behavioral design principles into learning management systems and online platforms. Drawing from clinical nudge units and implementation science, we demonstrate how defaults, social norms, and commitment devices can be systematically applied to digital continuing education. As medical education becomes increasingly technology-mediated, behavioral science provides theoretical foundation and practical tools for designing online learning environments that align with how clinicians actually make decisions.

Effectiveness of Informed AI Use on Clinical Competence of General Practitioners and Internists: Pre-Post Intervention Study

2026-02-05T16:30:04-05:00

Background: Artificial intelligence (AI) shows promise in clinical diagnosis, treatment support, and health care efficiency. However, its adoption in real-world practice remains limited due to insufficient clinical validation and an unclear impact on practitioners’ competence. Addressing these gaps is essential for effective, confident, and ethical integration of AI into modern health care settings. Objective: This study aimed to evaluate the effectiveness of informed AI use, following a tailored AI training course, on the performance of general practitioners (GPs) and internists in test-based clinical competence assessments and their attitudes toward clinical AI applications. Methods: A pre-post intervention study was conducted with 326 physicians from 39 countries. Participants completed a baseline test of clinical decision-making skills, covering diagnosis, treatment planning, and patient counseling; attended a 1.5-hour online training on effective AI use; and then took a similar postcourse test with AI assistance permitted (GPT-4.0). Test performance and time per question were compared before and after the training. Participants also rated AI accuracy, efficiency, perceived need for structured AI training, and their willingness to use AI in clinical practice before and after the course. Results: The average test scores improved from 56.9% (SD 15.7%) to 77.6% (SD 12.7%; P<.001), and the pass rate increased from 6.4% (21/326) to 58.6% (191/326), with larger gains observed among GPs and younger physicians. All skill domains (diagnosis, treatment planning, and patient counseling) improved significantly (all P<.001), while time taken to complete the test increased slightly from before to after the course (mean 40.25, SD 16.14 min vs 42.29, SD 14.02 min; P=.03). By the end of the intervention, physicians viewed AI more favorably, reporting increased confidence in its accuracy and time efficiency, greater appreciation for the need for structured AI training, and increased confidence and willingness to integrate AI into patient care. Conclusions: Informed use of AI, based on tailored training, was associated with higher performance in test-based clinical decision-making assessments and greater confidence in using AI among GPs and internists. Building on previous research that often lacked structured training, focused primarily on model performance, or was limited in clinical scope, this study provides empirical evidence of both competence and perceptual improvement following informed AI use in a large, multinational cohort, enhancing the generalizability. These findings support the integration of structured AI training into medical education and continuing professional development to improve clinical performance and promote competent use of AI in clinical practice.

Effect of an Online Continuing Professional Development Course on Physicians’ Intention to Approach a Colleague in Difficulty: Mixed Methods Convergent Study

2026-02-05T15:30:31-05:00

Background: Burnout and psychological distress are prevalent among physicians. Peer support appears to play a protective role, yet little is known about training interventions that motivate physicians to approach peers in difficulty, as such effects are often overlooked or assessed using nonvalidated tools. Objective: We evaluated the effects of an online continuing professional development (CPD) course designed to increase physicians’ intention to approach a colleague in difficulty. Methods: Physicians who completed a 1-hour asynchronous online CPD course between March 2022 and May 2024 were invited to participate in this mixed methods convergent study. The e-learning course aimed to increase physicians’ confidence in approaching colleagues in difficulty by recognizing signs of psychological distress, offering support, and referring them to appropriate resources. Participant characteristics were collected, and behavioral intention to approach a colleague in difficulty along with its determinants were measured pre- and postcourse using the validated CPD-REACTION tool. Differences in mean pre-post intention scores were assessed using 2-tailed paired t tests (n=466) and generalized estimating equations. Factors associated with postcourse intention were examined using multivariate analysis (n=466). Four months later, the proportion of physicians reporting adoption of the behavior was calculated (n=61). Qualitative responses to open-ended questions were analyzed thematically using behavior change models, and behavior change techniques used in the course were identified. Quantitative and qualitative results were triangulated. We reported results following STROBE (Strengthening the Reporting of Observational Studies in Epidemiology) and SRQR (Standards for Reporting Qualitative Research) guidelines for quantitative and qualitative analyses, respectively. Results: Among 792 participating physicians, 466 (58.8%) completed online questionnaires pre- and postcourse. The average participant age was 48 (SD 12.4) years; 43.5% (332/762) were women, and 86% (655/762) were specialists. The average precourse intention score was 3.88 (SD 1.73) and average postcourse intention score was 4.92 (SD 1.40), for an adjusted mean difference of 1.06 (95% CI 0.93-1.20; P<.001). Factors associated with postcourse intention were beliefs about capabilities (β=0.52; P<.001), social influences (β=0.27; P<.001), and moral norm (β=0.26; P=.03; R²=0.22). Four months later, 41% (25/61; 95% CI 28.6%-54.3%) of participants reported having approached a colleague in difficulty. Frequently reported reasons for intention to adopt behavior were beliefs about capabilities, beliefs about consequences, and knowledge. Quantitative and qualitative results converged on beliefs about capabilities but diverged regarding beliefs about consequences. A total of 7 behavioral change techniques were identified in the CPD course: goal setting, increasing competence, planning, persuasive communication, behavior-related information, modeling, and behavioral experiments. Conclusions: This online CPD course increased physicians’ intention to approach a colleague in difficulty. The results highlight beliefs about capabilities as a key determinant of this behavioral intention. The study suggests that online learning has strong potential to raise awareness about peer support and ultimately build a culture of care among health care workers. Trial Registration:

Blended Learning Compared With Face-to-Face Learning Among Family Medicine Residents: Randomized Controlled Trial

2026-02-04T15:00:15-05:00

Background: The medical education of French family medicine residents involves active, socio-constructivist-inspired small-group courses useful for skill acquisition. This is challenged by the increasing gap between the growing number of residents and the limited number of teachers. Blended courses have the potential to address this issue by reducing the duration of face-to-face sessions while preserving small-group courses. Objective: To compare the effects of blended versus traditional face-to-face active, socio-constructivist learning on the acquisition of knowledge and skills by family medicine residents. Methods: We conducted a randomized controlled trial to compare a blended course and a traditional course. The blended course involved 2.5 hours of asynchronous e-learning and a 3-hour face-to-face session. The traditional course involved 5.5 hours of face-to-face teaching. Both courses were grounded in socio-constructivist principles and actively engaged residents. The primary outcome was residents’ self-assessment of knowledge and skills. Secondary outcomes included satisfaction with knowledge or skills-related learning objectives, and academic achievement at 6 months. Results: We included 155 family medicine residents (78 in the blended course and 77 in the traditional course). There was no significant difference between groups regarding the primary outcome (mean difference 0.40 [possible 20], 95%CI [−0.21, 1.02], p = 0.19, Cohen’s d = 0.21). No significant differences were observed for the secondary outcomes, except for knowledge self-assessment, which was higher in the blended course but not educationally meaningful (mean difference 0.40 [possible 10], 95%CI [0.07, 0.71], p = 0.02, Cohen’s d = 0.39). Conclusions: Blended courses can help sustain socio-constructivist small-group teaching methods while accommodating a growing family medicine resident population, with no deleterious impact on knowledge and skills self-assessments. Clinical Trial: ClinicalTrials.gov NCT06409273; https://clinicaltrials.gov/ct2/show/NCT06409273

Investigating the Impact of a Virtual Reality Experience on Medical Student Empathy: Mixed Methods Study

2026-02-04T13:30:09-05:00

Background: Physician empathy is important not only for improving patient satisfaction and health outcomes but also for increasing physician job satisfaction and protecting against burnout. Amidst concerns over declining empathy levels in medical education, however, there is a need for innovative teaching approaches that address the empathy gap, a critical element in patient-centered care. Objective: This study used a mixed-methods analysis to explore the effectiveness of a Virtual Reality (VR) intervention versus traditional lecture methods in enhancing empathy among medical students. Methods: Fifty first- and second-year medical students were randomized to either a VR intervention, which simulated patient experiences, or a control group receiving traditional empathy lectures. Both groups watch two videos with reflections gathered after each video to capture students’ experiential learning. Empathy was measured using the Jefferson Scale of Empathy (student version) before and after the intervention. Results: Quantitative analysis revealed significant increases in empathy scores post-intervention for both groups (control group: mean increase = 4.71, SD = 11.01; VR group: mean increase = 5.6, SD = 10.02; p < .001), indicating that both interventions enhanced empathy. The VR group exhibited a significant difference in qualitative empathy coding after the second video (U = 165.5, p <.001) compared to the control. Qualitative feedback from the VR group emphasized a more profound emotional and cognitive engagement with the patient perspective than the lecture group. Conclusions: This study supports the integration of VR into medical education as a complementary approach to traditional teaching methods for empathy training. VR immersion provides a valuable platform for students to develop a deeper, more nuanced understanding of empathy. These findings advocate for further exploration into VR's long-term impact on empathy in clinical practice.

Comparative Efficacy of Simulation-Based and Traditional Training in Ultrasound-Assisted Regional Anesthesia for Medical Students: Randomized Controlled Trial

2026-02-03T16:30:57-05:00

Background: Ultrasound is very important in medicine and teaching, but there are not many formal training programs. We also do not know much about what students think. To be good at using ultrasound, one needs to learn technical, thinking, and seeing skills. This is especially true in regional anesthesia (RA), where mistakes in reading images can cause problems. Training with simulations is a safe and good way to learn these skills. Some models are helpful for teaching how to perform procedures using ultrasound. Objective: This study aimed to evaluate the effectiveness, localization time, and success rate of traditional teaching versus a new simulation-based teaching method for RA designed by the investigators among undergraduate medical students. Methods: A prospective, randomized controlled trial was conducted at the University of Salamanca from April 2022 to January 2023. A total of 34 medical students in their fourth to sixth academic years were randomly allocated to either a simulation-based training group using the Haptic Ultrasound Probe or a traditional teaching group. The simulation approach used a realistic probe replica and a software-based ultrasound environment, whereas the traditional method comprised a theoretical lecture and curated audiovisual materials. Two days after training, participants underwent a blinded assessment requiring the identification of peripheral nerve plexuses using an ultrasound device. The primary outcome measured was the successful identification of nerves, and the secondary outcome was the time taken to complete each procedure. Data were analyzed using an intention-to-treat approach. Results: A total of 34 medical students (fourth to sixth years) were recruited to compare traditional teaching with simulation-based training in ultrasound-guided nerve localization. No statistically significant differences were found in the success rates between the groups. For the interscalene approach, the traditional teaching group achieved a 100% (17/17) success rate compared to 82% (14/17) in the simulation group (P=.07). The time to task completion was similar across most procedures. In the sciatic nerve division, the traditional teaching group was significantly faster, with a mean time of 42.4 (SD 39.5) seconds (P=.02). The regression models showed no significant interaction between the intervention type and academic year. Both teaching methods had positive educational impacts. Conclusions: Simulation-based learning effectively supports competency acquisition in RA and offers a safe, scalable alternative to traditional methods. Its integration into medical curricula may standardize training, improve skill consistency, and enhance patient safety. Further multicenter studies with larger, diverse cohorts are needed to validate these benefits and guide implementation in medical education.