%0 Journal Article
%@ 2369-3762
%I JMIR Publications
%V 11
%N 
%P e58897
%T Performance of ChatGPT-4 on Taiwanese Traditional Chinese Medicine Licensing Examinations: Cross-Sectional Study
%A Tseng,Liang-Wei
%A Lu,Yi-Chin
%A Tseng,Liang-Chi
%A Chen,Yu-Chun
%A Chen,Hsing-Yu
%K artificial intelligence
%K AI language understanding tools
%K ChatGPT
%K natural language processing
%K machine learning
%K Chinese medicine license exam
%K Chinese medical licensing examination
%K medical education
%K traditional Chinese medicine
%K large language model
%D 2025
%7 19.3.2025
%9 
%J JMIR Med Educ
%G English
%X Background: The integration of artificial intelligence (AI), notably ChatGPT, into medical education, has shown promising results in various medical fields. Nevertheless, its efficacy in traditional Chinese medicine (TCM) examinations remains understudied. Objective: This study aims to (1) assess the performance of ChatGPT on the TCM licensing examination in Taiwan and (2) evaluate the model’s explainability in answering TCM-related questions to determine its suitability as a TCM learning tool. Methods: We used the GPT-4 model to respond to 480 questions from the 2022 TCM licensing examination. This study compared the performance of the model against that of licensed TCM doctors using 2 approaches, namely direct answer selection and provision of explanations before answer selection. The accuracy and consistency of AI-generated responses were analyzed. Moreover, a breakdown of question characteristics was performed based on the cognitive level, depth of knowledge, types of questions, vignette style, and polarity of questions. Results: ChatGPT achieved an overall accuracy of 43.9%, which was lower than that of 2 human participants (70% and 78.4%). The analysis did not reveal a significant correlation between the accuracy of the model and the characteristics of the questions. An in-depth examination indicated that errors predominantly resulted from a misunderstanding of TCM concepts (55.3%), emphasizing the limitations of the model with regard to its TCM knowledge base and reasoning capability. Conclusions: Although ChatGPT shows promise as an educational tool, its current performance on TCM licensing examinations is lacking. This highlights the need for enhancing AI models with specialized TCM training and suggests a cautious approach to utilizing AI for TCM education. Future research should focus on model improvement and the development of tailored educational applications to support TCM learning. 
%R 10.2196/58897
%U https://mededu.jmir.org/2025/1/e58897
%U https://doi.org/10.2196/58897