Author of the publication

Hierarchical Timbre-Cadence Speaker Encoder for Zero-shot Speech Synthesis.

, , , , , , and . INTERSPEECH, page 4334-4338. ISCA, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Hierarchical Context-Aware Transformers for Non-Autoregressive Text to Speech., , , and . Interspeech, page 3610-3614. ISCA, (2021)GANSpeech: Adversarial Training for High-Fidelity Multi-Speaker Speech Synthesis., , , , and . Interspeech, page 2202-2206. ISCA, (2021)Latent Filling: Latent Space Data Augmentation for Zero-Shot Speech Synthesis., , , , , , and . ICASSP, page 11166-11170. IEEE, (2024)Avocodo: Generative Adversarial Network for Artifact-Free Vocoder., , , , , and . AAAI, page 12562-12570. AAAI Press, (2023)A Neural Text-to-Speech Model Utilizing Broadcast Data Mixed with Background Music., , , , and . ICASSP, page 6603-6607. IEEE, (2021)Speaking Speed Control of End-to-End Speech Synthesis Using Sentence-Level Conditioning., , , , , and . INTERSPEECH, page 4402-4406. ISCA, (2020)FastPitchFormant: Source-Filter Based Decomposed Modeling for Speech Synthesis., , , , and . Interspeech, page 116-120. ISCA, (2021)Hierarchical and Multi-Scale Variational Autoencoder for Diverse and Natural Non-Autoregressive Text-to-Speech., , , and . INTERSPEECH, page 813-817. ISCA, (2022)Hierarchical Timbre-Cadence Speaker Encoder for Zero-shot Speech Synthesis., , , , , , and . INTERSPEECH, page 4334-4338. ISCA, (2023)Mels-Tts : Multi-Emotion Multi-Lingual Multi-Speaker Text-To-Speech System Via Disentangled Style Tokens., , , , , , and . ICASSP, page 12682-12686. IEEE, (2024)