Author of the publication

It's Never Too Late: Fusing Acoustic Information into Large Language Models for Automatic Speech Recognition.

, , , , , , and . ICLR, OpenReview.net, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Temporal filter design by minimum KL divergence criterion for robust speech recognition., , and . ICASSP, page 7908-7912. IEEE, (2013)Reducing the computational requirement of the orthogonal least squares algorithm., , and . ICASSP (3), page 529-532. IEEE Computer Society, (1994)Lasso environment model combination for robust speech recognition., , , and . ICASSP, page 4305-4308. IEEE, (2012)Spoken Language Recognition with Relevance Feedback., , , , and . ICASSP (4), page 861-864. IEEE, (2007)Automatic Sports Video Genre Classification using Pseudo-2D-HMM., , and . ICPR (4), page 778-781. IEEE Computer Society, (2006)Phoneme lattice based texttiling towards multilingual story segmentation., , , , and . INTERSPEECH, page 1305-1308. ISCA, (2010)SEAME: a Mandarin-English code-switching speech corpus in south-east asia., , , and . INTERSPEECH, page 1986-1989. ISCA, (2010)Vulnerability evaluation of speaker verification under voice conversion spoofing: the effect of text constraints., , , , , and . INTERSPEECH, page 950-954. ISCA, (2013)High quality voice conversion using prosodic and high-resolution spectral features., , , , and . CoRR, (2015)Detecting synthetic speech using long term magnitude and phase information., , , , , and . ChinaSIP, page 611-615. IEEE, (2015)