Robust Vietnamese-English Clinical and Educational Medical Translation
: PiDA and LLM-generated Near-Misses accepted to Interspeech 2026.
: ViMedCSS accepted to LREC 2026.
Project Overview
We are a research group at VinUniversity, Vietnam focusing on improving machine translation in the medical domain. Our work addresses the critical need for accurate Vietnamese-English translation in both clinical and educational medical contexts.
Check out our published works:
- LLM-generated near-misses (Interspeech '26) – Contrastive training for robust code-switching speech recognition
- PiDA (Interspeech '26) – Phonetically-informed data augmentation for robust Vietnamese speech translation
- ViMedCSS dataset (LREC '26) – Vietnamese Medical Code-Switching Speech dataset & benchmark
- MedEV dataset (LREC-COLING '24) – Vietnamese-English parallel dataset for medical machine translation
Publications & Datasets
PiDA: Phonetically-Informed Data Augmentation for Robust Vietnamese Speech Translation
Authors: Giang Son Nguyen, Tung X. Nguyen, Hieu Minh Truong, Nhu Vo, Wray Buntine, Dung D. Le
Conference: Interspeech 2026
Contrastive Training with LLM-generated Near-Misses for Robust Code-Switching Speech Recognition
Authors: Tung X. Nguyen*, Hieu Minh Truong*, Giang Son Nguyen, Nhu Vo, Wray Buntine, Dung D. Le
* Equal contribution
Conference: Interspeech 2026
ViMedCSS: A Vietnamese Medical Code-Switching Speech Dataset & Benchmark
Authors: Tung X. Nguyen, Nhu Vo, Giang Son Nguyen, Duy Mai Hoang, Chien Dinh Huynh, Iñigo Jauregi Unanue, Massimo Piccardi, Wray Buntine, Dung D. Le
Conference: LREC 2026
Improving Vietnamese-English Medical Machine Translation
Authors: Nhu Vo, Dat Quoc Nguyen, Dung D. Le, Massimo Piccardi, Wray Buntine
Conference: LREC-COLING 2024
Team Members
Principal Investigators
Project PI
Project PI
Researchers
Research Assistant, MSc CS Student @ VinUni
Research Assistant @ VinUni
Research Intern, MSc CS Student @ VinUni
Research Intern @ VinUni
Collaborators
CHS, VinUni
CHS, VinUni
CHS, VinUni
University of Technology Sydney
University of Technology Sydney
Former Members
Others
Media Coverage
Prof. Dung's interview with Vietnamese media site Dan Tri.
View articleVietnamese-English Real Time Medical Speech Translation Prototype
We are developing a real-time Vietnamese-English speech translation prototype for medical applications.
Coming Soon
Guest Lecture on Introduction to Machine Translation at VinUni
Our researcher gave a guest lecture introducing students to the theoretical foundations and practical applications of MT systems.