Author of the publication

Av-Sepformer: Cross-Attention Sepformer for Audio-Visual Target Speaker Extraction.

, , , , , , , , , and . ICASSP, page 1-5. IEEE, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Data Augmentation For Children's Speech Recognition - The "Ethiopian" System For The SLT 2021 Children Speech Recognition Challenge., , , , , , and . CoRR, (2020)Combining KNN algorithm and other classifiers., and . IEEE ICCI, page 800-805. IEEE Computer Society, (2010)Unified Keyword Spotting and Audio Tagging on Mobile Devices with Transformers., , , , and . ICASSP, page 1-5. IEEE, (2023)Improved rule based rough set approach for target recognition., , , and . GrC, page 550-555. IEEE, (2005)Incremental Target Recognition Algorithm Based on Improved Discernibility Matrix., , , and . FSKD (1), volume 3613 of Lecture Notes in Computer Science, page 1246-1255. Springer, (2005)speechocean762: An Open-Source Non-Native English Speech Corpus for Pronunciation Assessment., , , , , , , , and . Interspeech, page 3710-3714. ISCA, (2021)Enhancing Automated Audio Captioning via Large Language Models with Optimized Audio Encoding., , , , , , , and . CoRR, (2024)Optimal diagonal precoder for multiantenna communication systems., , and . IEEE Trans. Signal Process., 53 (6): 2089-2100 (2005)Studies on classification models using decision boundaries., and . IEEE ICCI, page 287-294. IEEE Computer Society, (2009)Pseudo Strong Labels for Large Scale Weakly Supervised Audio Tagging., , , , and . ICASSP, page 336-340. IEEE, (2022)