Author of the publication

Multimodal Speech Synthesis Architecture for Unsupervised Speaker Adaptation.

, and . INTERSPEECH, page 2494-2498. ISCA, (2018)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Latent linguistic embedding for cross-lingual text-to-speech and voice conversion., and . CoRR, (2020)Deep learning based voice cloning framework for a unified system of text-to-speech and voice conversion.. Graduate University for Advanced Studies, Japan, (2020)Multimodal Speech Synthesis Architecture for Unsupervised Speaker Adaptation., and . INTERSPEECH, page 2494-2498. ISCA, (2018)Investigating Accuracy of Pitch-accent Annotations in Neural Network-based Speech Synthesis and Denoising Effects., , , and . INTERSPEECH, page 37-41. ISCA, (2018)NAUTILUS: A Versatile Voice Cloning System., and . IEEE ACM Trans. Audio Speech Lang. Process., (2020)LaughNet: synthesizing laughter utterances from waveform silhouettes and a single laughter example., and . CoRR, (2021)Scaling and Bias Codes for Modeling Speaker-Adaptive DNN-Based Speech Synthesis Systems., and . SLT, page 610-617. IEEE, (2018)Training Multi-Speaker Neural Text-to-Speech Systems Using Speaker-Imbalanced Speech Corpora., , , and . INTERSPEECH, page 1303-1307. ISCA, (2019)Preliminary study on using vector quantization latent spaces for TTS/VC systems with consistent performance., and . CoRR, (2021)A non-expert Kaldi recipe for Vietnamese Speech Recognition System., and . WLSI/OIAF4HLT@COLING, page 51-55. The COLING 2016 Organizing Committee, (2016)