Published on in Vol 11 (2025)

Preprints (earlier versions) of this paper are available at https://preprints.jmir.org/preprint/70420, first published .
Enhancing AI-Driven Medical Translations: Considerations for Language Concordance

Enhancing AI-Driven Medical Translations: Considerations for Language Concordance

Enhancing AI-Driven Medical Translations: Considerations for Language Concordance

Authors of this article:

Stephanie Quon1 Author Orcid Image ;   Sarah Zhou2 Author Orcid Image

1Faculty of Medicine, University of British Columbia, 2194 Health Sciences Mall, Vancouver, BC, Canada

2Faculty of Science, University of British Columbia, Vancouver, BC, Canada

Corresponding Author:

Stephanie Quon, BASc



We commend the recent publication by Dzuali et al [Dzuali F, Seiger K, Novoa R, et al. ChatGPT may improve access to language-concordant care for patients with non-English language preferences. JMIR Med Educ. Dec 10, 2024;10:e51435. [CrossRef] [Medline]1], which explored the application of ChatGPT for translating patient education materials into multiple languages. This important study highlights a critical area where artificial intelligence (AI) can potentially bridge gaps in language-concordant care. To further this research, we would like to raise several points to enrich the discussion and understanding of the findings.

The study demonstrates that while ChatGPT provides clinically usable translations for Spanish and Russian, its performance with Mandarin is suboptimal. This inconsistency raises important questions regarding the linguistic complexities and structural differences between English and Mandarin, which may hinder the accuracy and appropriateness of translations. Previous research has shown that the nuanced sentence structures and specialized terminology in Mandarin pose challenges for AI models such as ChatGPT, suggesting the need for more refined approaches when using AI for translation in linguistically distinct languages [Jiao W, et al. Is ChatGPT a good translator? A preliminary study. arXiv. Preprint posted online on Oct 1, 2023. [Medline]2].

Being familiar with the Mandarin language, we have firsthand experience with the challenges that come with translating between languages with distinct linguistic structures. Mandarin, with its nuanced sentence structures and specialized terminology, presents difficulties for large language models such as ChatGPT. These challenges are compounded by differences in grammar, idiomatic expressions, and cultural contexts, which may lead to inaccuracies and misunderstandings in translations. Therefore, this study could provide additional insight into how cultural context influences translation quality. Mandarin, for example, involves not only linguistic precision but also an understanding of cultural nuances that could affect comprehension [Duff P, et al. Learning Chinese: Linguistic, Sociocultural, and Narrative Perspectives. Vol 5. Walter de Gruyter; 2013. 3]. Future studies could explore how AI models such as ChatGPT are trained to account for these contextual factors to ensure culturally appropriate translations.

Another area for potential exploration in this study is the testing of alternative prompts and the impact they may have on translation quality. While the study focuses on a single translation prompt—“Translate this into <target language>”—the variability of AI-generated translations could be better evaluated through a variety of prompts. Utilizing multiple prompts could reveal a broader range of performance outcomes, especially for linguistically complex languages such as Mandarin and Russian. Other studies have shown that different AI prompts can produce vastly different results [Oppenlaender J, Linder R, Silvennoinen J. Prompting AI art: an investigation into the creative skill of prompt engineering. Int J Hum Comput. 2024:1-23. [CrossRef]4].

Lastly, the study heavily relies on the involvement of board-certified dermatologists for posttranslation review, which is applicable to the context of dermatology-related information, but may not fully address the extent of errors and misinformation. While human oversight is essential, the study could benefit from a more robust evaluation of how different levels of human intervention—such as linguistic experts or specialists in medical translation—might improve translation accuracy [Topol EJ. High-performance medicine: the convergence of human and artificial intelligence. Nat Med. Jan 2019;25(1):44-56. [CrossRef] [Medline]5]. Future research should explore how different combinations of AI-generated translations and human review from varied sources could optimize clinical usability.

Overall, while ChatGPT shows promise for improving access to language-concordant patient education, further refinement and validation are required. This study is an important milestone in starting this discussion surrounding AI-translation in medical contexts, and we commend the authors for their valuable contribution to advancing the field. They clearly demonstrate a meticulous approach, thoughtful analysis, and commitment to improving patient care through innovative solutions.

Conflicts of Interest

None declared.

  1. Dzuali F, Seiger K, Novoa R, et al. ChatGPT may improve access to language-concordant care for patients with non-English language preferences. JMIR Med Educ. Dec 10, 2024;10:e51435. [CrossRef] [Medline]
  2. Jiao W, et al. Is ChatGPT a good translator? A preliminary study. arXiv. Preprint posted online on Oct 1, 2023. [Medline]
  3. Duff P, et al. Learning Chinese: Linguistic, Sociocultural, and Narrative Perspectives. Vol 5. Walter de Gruyter; 2013.
  4. Oppenlaender J, Linder R, Silvennoinen J. Prompting AI art: an investigation into the creative skill of prompt engineering. Int J Hum Comput. 2024:1-23. [CrossRef]
  5. Topol EJ. High-performance medicine: the convergence of human and artificial intelligence. Nat Med. Jan 2019;25(1):44-56. [CrossRef] [Medline]


AI: artificial intelligence


Edited by Tiffany Leung; This is a non–peer-reviewed article. submitted 20.12.24; accepted 27.01.25; published 11.04.25.

Copyright

© Stephanie Quon, Sarah Zhou. Originally published in JMIR Medical Education (https://mededu.jmir.org), 11.4.2025.

This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in JMIR Medical Education, is properly cited. The complete bibliographic information, a link to the original publication on https://mededu.jmir.org/, as well as this copyright and license information must be included.