Author of the publication

Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches.

, , , , and . INTERSPEECH, page 5333-5337. ISCA, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Omnidirectional Motion Control Method of Quadruped Robot Based on 3D-CPG Oscillator Group., , , , , and . CLAWAR, volume 530 of Lecture Notes in Networks and Systems, page 301-312. Springer, (2022)InstructTTS: Modelling Expressive TTS in Discrete Latent Space with Natural Language Style Prompt., , , , , , and . CoRR, (2023)DPM-TSE: A Diffusion Probabilistic Model for Target Sound Extraction., , , , , and . CoRR, (2023)RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection., , , , and . INTERSPEECH, page 1511-1515. ISCA, (2022)Improving Target Sound Extraction with Timestamp Information., , , , and . INTERSPEECH, page 1526-1530. ISCA, (2022)PromptTTS 2: Describing and Generating Voices with Text Prompt., , , , , , , , , and 5 other author(s). CoRR, (2023)InstructSpeech: Following Speech Editing Instructions via Large Language Models., , , , , , , , , and 1 other author(s). ICML, OpenReview.net, (2024)Diffsound: Discrete Diffusion Model for Text-to-sound Generation., , , , , , and . CoRR, (2022)Improving the Performance of Automated Audio Captioning via Integrating the Acoustic and Semantic Information., , , and . DCASE, page 40-44. (2021)YOLOv3 with Asymmetric Intersection over Union Based Loss Function for Human Detection., , , , , and . ICMLSC, page 70-76. ACM, (2021)