Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Leveraging Modality-Specific Representations for Audio-Visual Speech Recognition via Reinforcement Learning., , , , , and . AAAI, page 12607-12615. AAAI Press, (2023)Noise-Robust Speech Recognition With 10 Minutes Unparalleled In-Domain Data., , , , and . ICASSP, page 4298-4302. IEEE, (2022)SAFLFusionGait: Gait recognition network with separate attention and different granularity feature learnability fusion., , , , and . J. Vis. Commun. Image Represent., (2024)Enhancing Zero-shot Text-to-Speech Synthesis with Human Feedback., , , , , and . CoRR, (2024)SSR-Speech: Towards Stable, Safe and Robust Zero-shot Text-based Speech Editing and Synthesis., , , , , , , and . CoRR, (2024)A Neural State-Space Model Approach to Efficient Speech Separation., , , , , and . CoRR, (2023)UniS-MMC: Multimodal Classification via Unimodality-supervised Multimodal Contrastive Learning., , , , , and . ACL (Findings), page 659-672. Association for Computational Linguistics, (2023)Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation., , , , and . ICASSP, page 1-5. IEEE, (2023)Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition., , , , and . ICASSP, page 1-5. IEEE, (2023)MIR-GAN: Refining Frame-Level Modality-Invariant Representations with Adversarial Network for Audio-Visual Speech Recognition., , , , and . ACL (1), page 11610-11625. Association for Computational Linguistics, (2023)