VinUniversity Logo

Robust Vietnamese-English Clinical and Educational Medical Translation

News

: PiDA and LLM-generated Near-Misses accepted to Interspeech 2026.

: ViMedCSS accepted to LREC 2026.

Project Overview

We are a research group at VinUniversity, Vietnam focusing on improving machine translation in the medical domain. Our work addresses the critical need for accurate Vietnamese-English translation in both clinical and educational medical contexts.

Check out our published works:

  • LLM-generated near-misses (Interspeech '26) – Contrastive training for robust code-switching speech recognition
  • PiDA (Interspeech '26) – Phonetically-informed data augmentation for robust Vietnamese speech translation
  • ViMedCSS dataset (LREC '26) – Vietnamese Medical Code-Switching Speech dataset & benchmark
  • MedEV dataset (LREC-COLING '24) – Vietnamese-English parallel dataset for medical machine translation

Publications & Datasets

PiDA: Phonetically-Informed Data Augmentation for Robust Vietnamese Speech Translation

Authors: Giang Son Nguyen, Tung X. Nguyen, Hieu Minh Truong, Nhu Vo, Wray Buntine, Dung D. Le
Conference: Interspeech 2026

arXiv

Contrastive Training with LLM-generated Near-Misses for Robust Code-Switching Speech Recognition

Authors: Tung X. Nguyen*, Hieu Minh Truong*, Giang Son Nguyen, Nhu Vo, Wray Buntine, Dung D. Le
* Equal contribution
Conference: Interspeech 2026

arXiv

ViMedCSS: A Vietnamese Medical Code-Switching Speech Dataset & Benchmark

Authors: Tung X. Nguyen, Nhu Vo, Giang Son Nguyen, Duy Mai Hoang, Chien Dinh Huynh, Iñigo Jauregi Unanue, Massimo Piccardi, Wray Buntine, Dung D. Le
Conference: LREC 2026

Improving Vietnamese-English Medical Machine Translation

Authors: Nhu Vo, Dat Quoc Nguyen, Dung D. Le, Massimo Piccardi, Wray Buntine
Conference: LREC-COLING 2024

Team Members

Principal Investigators

Researchers

Nhu Vo

Nhu Vo, MSc

PhD Student @ VinUni & UTS. Prev: MSc CS @ HCMUS

Vingroup Scholar

Researcher Name

Giang Son Nguyen, MSc

Research Assistant @ VinUni. Prev: MSc DS @ NTUsg

Vingroup Scholar

Nu-Uyen-Phuong Le

Nu-Uyen-Phuong Le, MSc

Research Assistant @ VinUni. Prev: MSc DS @ UQ

Vingroup Scholar

Tung X. Nguyen

Tung X. Nguyen

Research Assistant, MSc CS Student @ VinUni

Phuong Thi Kim Nguyen

Phuong Thi Kim Nguyen

Research Assistant @ VinUni

Hieu Minh Truong

Hieu Minh Truong

Research Intern, MSc CS Student @ VinUni

Nhi Ngoc-Yen Nguyen

Nhi Ngoc-Yen Nguyen

Research Intern @ VinUni

Collaborators

Duy Mai Hoang

Duy Mai Hoang, MSc

CHS, VinUni

Prof. Massimo Piccardi

Prof. Massimo Piccardi

University of Technology Sydney

Dr. Inigo Jauregi Unanue

Dr. Inigo Jauregi Unanue

University of Technology Sydney

Former Members

Minh Binh Vu

Minh Binh Vu

BSc CS @ VinUni. Now: MSc Student @ UC Berkeley

Vingroup Scholar

Others

Media Coverage

Prof. Dung's interview with Dan Tri

Prof. Dung's interview with Vietnamese media site Dan Tri.

View article

Vietnamese-English Real Time Medical Speech Translation Prototype

We are developing a real-time Vietnamese-English speech translation prototype for medical applications.

Coming Soon

Guest Lecture on Introduction to Machine Translation at VinUni

Our researcher gave a guest lecture introducing students to the theoretical foundations and practical applications of MT systems.