Author of the publication

Context-Aware Coherent Speaking Style Prediction with Hierarchical Transformers for Audiobook Speech Synthesis.

, , , , , and . ICASSP, page 1-5. IEEE, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Towards Expressive Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech Synthesis., , , , , and . ICASSP, page 7922-7926. IEEE, (2022)Towards Multi-Scale Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech Synthesis., , , , , , and . INTERSPEECH, page 5523-5527. ISCA, (2022)GTN-Bailando: Genre Consistent long-Term 3D Dance Generation Based on Pre-Trained Genre Token Network., , , , , , , , and . ICASSP, page 1-5. IEEE, (2023)Unsupervised Multi-scale Expressive Speaking Style Modeling with Hierarchical Context Information for Audiobook Speech Synthesis., , , , , and . COLING, page 7193-7202. International Committee on Computational Linguistics, (2022)AdaMesh: Personalized Facial Expressions and Head Poses for Speech-Driven 3D Facial Animation., , , , , , and . CoRR, (2023)The THU-HCSI Multi-Speaker Multi-Lingual Few-Shot Voice Cloning System for LIMMITS'24 Challenge., , , , and . CoRR, (2024)SimCalib: Graph Neural Network Calibration Based on Similarity between Nodes., , , , , , and . AAAI, page 15267-15275. AAAI Press, (2024)MSStyleTTS: Multi-Scale Style Modeling With Hierarchical Context Information for Expressive Speech Synthesis., , , , , , and . IEEE ACM Trans. Audio Speech Lang. Process., (2023)MRC-LSTM: A Hybrid Approach of Multi-scale Residual CNN and LSTM to Predict Bitcoin Price., , , and . IJCNN, page 1-8. IEEE, (2021)Towards Improving the Expressiveness of Singing Voice Synthesis with BERT Derived Semantic Information., , , , , , , and . INTERSPEECH, page 4292-4296. ISCA, (2022)