Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

AntCritic: Argument Mining for Free-Form and Visually-Rich Financial Comments., , , , , and . LREC/COLING, page 1306-1317. ELRA and ICCL, (2024)Wav2SQL: Direct Generalizable Speech-To-SQL Parsing., , , , , , and . ACL (Findings), page 4230-4242. Association for Computational Linguistics, (2024)MEDIC: Zero-shot Music Editing with Disentangled Inversion Control., , , , , and . CoRR, (2024)CosyVoice 2: Scalable Streaming Speech Synthesis with Large Language Models., , , , , , , , , and 9 other author(s). CoRR, (2024)TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation., , , , , , and . CoRR, (2022)ViT-TTS: Visual Text-to-Speech with Scalable Diffusion Transformer., , , , , , , and . EMNLP, page 15957-15969. Association for Computational Linguistics, (2023)Both Ears Wide Open: Towards Language-Driven Spatial Audio Generation., , , , , , , and . CoRR, (2024)AudioLCM: Efficient and High-Quality Text-to-Audio Generation with Minimal Inference Steps., , , , , , , and . ACM Multimedia, page 7008-7017. ACM, (2024)TranSpeech: Speech-to-Speech Translation With Bilateral Perturbation., , , , , , and . ICLR, OpenReview.net, (2023)Wav2SQL: Direct Generalizable Speech-To-SQL Parsing., , , , , , and . CoRR, (2023)