Author of the publication

Towards Spontaneous Style Modeling with Semi-supervised Pre-training for Conversational Text-to-Speech Synthesis.

, , , , , , and . INTERSPEECH, page 3377-3381. ISCA, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Towards Expressive Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech Synthesis., , , , , and . ICASSP, page 7922-7926. IEEE, (2022)Towards Multi-Scale Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech Synthesis., , , , , , and . INTERSPEECH, page 5523-5527. ISCA, (2022)NRAdapt: Noise-Robust Adaptive Text to Speech Using Untranscribed Data., , , , and . IJCNN, page 1-8. IEEE, (2024)MuCodec: Ultra Low-Bitrate Music Codec., , , , , , , and . CoRR, (2024)Unsupervised Multi-scale Expressive Speaking Style Modeling with Hierarchical Context Information for Audiobook Speech Synthesis., , , , , and . COLING, page 7193-7202. International Committee on Computational Linguistics, (2022)GTN-Bailando: Genre Consistent long-Term 3D Dance Generation Based on Pre-Trained Genre Token Network., , , , , , , , and . ICASSP, page 1-5. IEEE, (2023)VoxInstruct: Expressive Human Instruction-to-Speech Generation with Unified Multilingual Codec Language Modelling., , , , , , , and . CoRR, (2024)Improving Language Model-Based Zero-Shot Text-to-Speech Synthesis with Multi-Scale Acoustic Prompts., , , , , , , , , and 1 other author(s). ICASSP, page 12662-12666. IEEE, (2024)SongCreator: Lyrics-based Universal Song Generation., , , , , , , , , and . CoRR, (2024)AdaMesh: Personalized Facial Expressions and Head Poses for Speech-Driven 3D Facial Animation., , , , , , and . CoRR, (2023)