Redirecting to original paper in 30 seconds...

Click below to go immediately or wait for automatic redirect

arxiv_ml 90% Match research paper NLP researchers,speech processing engineers,medical professionals,healthcare administrators,linguists 1 day ago

MultiMed-ST: Large-scale Many-to-many Multilingual Medical Speech Translation

speech-audio › speech-recognition
📄 Abstract

Abstract: Multilingual speech translation (ST) and machine translation (MT) in the medical domain enhances patient care by enabling efficient communication across language barriers, alleviating specialized workforce shortages, and facilitating improved diagnosis and treatment, particularly during pandemics. In this work, we present the first systematic study on medical ST, to our best knowledge, by releasing MultiMed-ST, a large-scale ST dataset for the medical domain, spanning all translation directions in five languages: Vietnamese, English, German, French, and Simplified/Traditional Chinese, together with the models. With 290,000 samples, this is the largest medical MT dataset and the largest many-to-many multilingual ST among all domains. Secondly, we present the most comprehensive ST analysis in the field's history, to our best knowledge, including: empirical baselines, bilingual-multilingual comparative study, end-to-end vs. cascaded comparative study, task-specific vs. multi-task sequence-to-sequence comparative study, code-switch analysis, and quantitative-qualitative error analysis. All code, data, and models are available online: https://github.com/leduckhai/MultiMed-ST
Authors (13)
Khai Le-Duc
Tuyen Tran
Bach Phan Tat
Nguyen Kim Hai Bui
Quan Dang
Hung-Phong Tran
+7 more
Submitted
April 4, 2025
arXiv Category
cs.CL
arXiv PDF

Key Contributions

This paper presents MultiMed-ST, the first large-scale, many-to-many multilingual medical speech translation dataset and models, covering five languages (Vietnamese, English, German, French, Chinese). It also provides the most comprehensive ST analysis to date, comparing empirical baselines, bilingual vs. multilingual, and end-to-end vs. cascaded approaches. This work aims to enhance communication in the medical domain, especially during crises like pandemics.

Business Value

Significantly improves global healthcare accessibility by breaking down language barriers, leading to better patient outcomes, reduced medical errors, and more efficient healthcare delivery worldwide.