FastSpeechStyle: Fast, Emotion-Controllable, High-Quality Speech Synthesis

Published in International Journal of Asian Language Processing, 2022

Authors: Do Tri Nhan et al. (2022). FastSpeechStyle: Fast, Emotion-Controllable, High-Quality Speech Synthesis. International Journal of Asian Language Processing.

An emotion-controllable TTS system extending FastSpeech2 with style transfer capabilities, enabling fast and high-quality emotional speech synthesis for Vietnamese.