Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Mega-TTS 2: Zero-Shot Text-to-Speech with Arbitrary Length Speech Prompts., , , , , , , , , and 1 other author(s). CoRR, (2023)Make-An-Audio: Text-To-Audio Generation with Prompt-Enhanced Diffusion Models., , , , , , , , , and . ICML, volume 202 of Proceedings of Machine Learning Research, page 13916-13932. PMLR, (2023)Make-A-Voice: Revisiting Voice Large Language Models as Scalable Multilingual and Multitask Learners., , , , , , , , , and 4 other author(s). ACL (1), page 10929-10942. Association for Computational Linguistics, (2024)MulliVC: Multi-lingual Voice Conversion With Cycle Consistency., , , , , , , , and . CoRR, (2024)RMSSinger: Realistic-Music-Score based Singing Voice Synthesis., , , , , , and . ACL (Findings), page 236-248. Association for Computational Linguistics, (2023)CLAPSpeech: Learning Prosody from Text Context with Contrastive Language-Audio Pre-Training., , , , , , , and . ACL (1), page 9317-9331. Association for Computational Linguistics, (2023)DopplerBAS: Binaural Audio Synthesis Addressing Doppler Effect., , , , , , and . ACL (Findings), page 11905-11912. Association for Computational Linguistics, (2023)Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation., , , , , , , , , and . CoRR, (2023)Mega-TTS 2: Boosting Prompting Mechanisms for Zero-Shot Speech Synthesis., , , , , , , , , and 3 other author(s). ICLR, OpenReview.net, (2024)Soft Hierarchical Graph Recurrent Networks for Many-Agent Partially Observable Environments., , , and . CoRR, (2021)