Author of the publication

Multimodal and Multiresolution Speech Recognition with Transformers.

, , , and . ACL, page 2381-2387. Association for Computational Linguistics, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Improving Noise Robustness of Automatic Speech Recognition via Parallel Data and Teacher-student Learning., , , , , , , and . ICASSP, page 6475-6479. IEEE, (2019)Multi-geometry Spatial Acoustic Modeling for Distant Speech Recognition., , , , and . ICASSP, page 6635-6639. IEEE, (2019)Fully Learnable Front-End for Multi-Channel Acoustic Modeling Using Semi-Supervised Learning., , , , and . ICASSP, page 6864-6868. IEEE, (2020)Clustering audio clips by context-free description and affective ratings., , and . EUSIPCO, page 472-476. IEEE, (2010)Experiments in Automatic Genre Classification of Full-length Music Tracks using Audio Activity Rate., and . MMSP, page 98-102. IEEE, (2007)Analysis of Audio Clustering using Word Descriptions., and . ICASSP (2), page 769-772. IEEE, (2007)Multi-channel Acoustic Modeling using Mixed Bitrate OPUS Compression., , and . CoRR, (2020)Experiments in context-independent recognition of non-lexical 'yes' or 'no' responses., , and . ICASSP, page 5696-5699. IEEE, (2011)Enhancing Contrastive Learning with Temporal Cognizance for Audio-Visual Representation Generation., , , and . ICASSP, page 4728-4732. IEEE, (2022)Multi-Scale Compositional Constraints for Representation Learning on Videos., , , and . ICASSP, page 1-5. IEEE, (2023)