Vietnamese Speech Synthesis with End-to-End Model and Text Normalization

Published in 7th NAFOSTED Conference 2020, 2020

Authors: Do Tri Nhan et al. (2020). Vietnamese Speech Synthesis with End-to-End Model and Text Normalization. 7th NAFOSTED Conference on Information and Computer Science 2020.

End-to-end Vietnamese TTS integrating a text normalization pipeline (vinorm) with Tacotron2 and WaveGlow, demonstrating improved naturalness on Vietnamese prosody.