Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Multimodal and Multiresolution Speech Recognition with Transformers.

G. Paraskevopoulos, S. Parthasarathy, A. Khare, and S. Sundaram. ACL, page 2381-2387. Association for Computational Linguistics, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

bgfdseawdewew Shiva RSPB Colleges

RSP Acedemic College Noida Shiva RSVP College Noida

Shiva

Narayanan Sundaram

Other publications of authors with the same name

Improving Noise Robustness of Automatic Speech Recognition via Parallel Data and Teacher-student Learning.L. Mosner, M. Wu, A. Raju, S. Parthasarathi, K. Kumatani, S. Sundaram, R. Maas, and B. Hoffmeister. ICASSP, page 6475-6479. IEEE, (2019)Multi-geometry Spatial Acoustic Modeling for Distant Speech Recognition.K. Kumatani, M. Wu, S. Sundaram, N. Ström, and B. Hoffmeister. ICASSP, page 6635-6639. IEEE, (2019)Fully Learnable Front-End for Multi-Channel Acoustic Modeling Using Semi-Supervised Learning.S. Wager, A. Khare, M. Wu, K. Kumatani, and S. Sundaram. ICASSP, page 6864-6868. IEEE, (2020)Clustering audio clips by context-free description and affective ratings.S. Sundaram, R. Schleicher, and J. Seebode. EUSIPCO, page 472-476. IEEE, (2010)Acoustic stopwords for unstructured audio information retrieval.S. Kim, S. Sundaram, P. Georgiou, and S. Narayanan. EUSIPCO, page 1277-1280. IEEE, (2010)Self-Supervised Learning with Cross-Modal Transformers for Emotion Recognition.A. Khare, S. Parthasarathy, and S. Sundaram. SLT, page 381-388. IEEE, (2021)Multi-Geometry Spatial Acoustic Modeling for Distant Speech Recognition.K. Kumatani, M. Wu, S. Sundaram, N. Strom, and B. Hoffmeister. CoRR, (2019)Frequency Domain Multi-channel Acoustic Modeling for Distant Speech Recognition.M. Wu, K. Kumatani, S. Sundaram, N. Ström, and B. Hoffmeister. ICASSP, page 6640-6644. IEEE, (2019)Enhancing Contrastive Learning with Temporal Cognizance for Audio-Visual Representation Generation.C. Lavania, S. Sundaram, S. Srinivasan, and K. Kirchhoff. ICASSP, page 4728-4732. IEEE, (2022)Multi-Scale Compositional Constraints for Representation Learning on Videos.G. Paraskevopoulos, C. Lavania, L. Chum, and S. Sundaram. ICASSP, page 1-5. IEEE, (2023)

BibSonomy

Disambiguation of "Sundaram, Shiva"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Multimodal and Multiresolution Speech Recognition with Transformers.

Please choose a person to relate this publication to

bgfdseawdewew Shiva RSPB Colleges

RSP Acedemic College Noida Shiva RSVP College Noida

Shiva

Shiva

Narayanan Sundaram

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Sundaram, Shiva"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Multimodal and Multiresolution Speech Recognition with Transformers.

Please choose a person to relate this publication to

bgfdseawdewew Shiva RSPB Colleges

RSP Acedemic College Noida Shiva RSVP College Noida

Shiva

Shiva

Narayanan Sundaram

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Multimodal and Multiresolution Speech Recognition with Transformers.