Author of the publication

Multimodal and Multiresolution Speech Recognition with Transformers.

, , , and . ACL, page 2381-2387. Association for Computational Linguistics, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Improving Noise Robustness of Automatic Speech Recognition via Parallel Data and Teacher-student Learning., , , , , , , and . ICASSP, page 6475-6479. IEEE, (2019)Multi-geometry Spatial Acoustic Modeling for Distant Speech Recognition., , , , and . ICASSP, page 6635-6639. IEEE, (2019)Fully Learnable Front-End for Multi-Channel Acoustic Modeling Using Semi-Supervised Learning., , , , and . ICASSP, page 6864-6868. IEEE, (2020)Clustering audio clips by context-free description and affective ratings., , and . EUSIPCO, page 472-476. IEEE, (2010)Acoustic stopwords for unstructured audio information retrieval., , , and . EUSIPCO, page 1277-1280. IEEE, (2010)Self-Supervised Learning with Cross-Modal Transformers for Emotion Recognition., , and . SLT, page 381-388. IEEE, (2021)Multi-Geometry Spatial Acoustic Modeling for Distant Speech Recognition., , , , and . CoRR, (2019)Frequency Domain Multi-channel Acoustic Modeling for Distant Speech Recognition., , , , and . ICASSP, page 6640-6644. IEEE, (2019)Enhancing Contrastive Learning with Temporal Cognizance for Audio-Visual Representation Generation., , , and . ICASSP, page 4728-4732. IEEE, (2022)Multi-Scale Compositional Constraints for Representation Learning on Videos., , , and . ICASSP, page 1-5. IEEE, (2023)