FastSpeechStyle: Fast, Emotion-Controllable, High-Quality Speech Synthesis
Published in International Journal of Asian Language Processing, 2022
Authors: Do Tri Nhan et al. (2022). FastSpeechStyle: Fast, Emotion-Controllable, High-Quality Speech Synthesis. International Journal of Asian Language Processing.
An emotion-controllable TTS system extending FastSpeech2 with style transfer capabilities, enabling fast and high-quality emotional speech synthesis for Vietnamese.
