Author of the publication

Train Long and Test Long: Leveraging Full Document Contexts in Speech Processing.

, , , , and . ICASSP, page 13066-13070. IEEE, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.


Other publications of authors with the same name

Error type classification and word accuracy estimation using alignment features from word confusion network., , and . ICASSP, page 4925-4928. IEEE, (2012)Discriminative recognition rate estimation for N-best list and its application to N-best rescoring., , and . ICASSP, page 6832-6836. IEEE, (2013)Espnet-Summ: Introducing a Novel Large Dataset, Toolkit, and a Cross-Corpora Evaluation of Speech Summarization Systems., , , , , , , , , and . ASRU, page 1-8. IEEE, (2023)Novel two-pass search strategy using time-asynchronous shortest-first second-pass beam search., , and . INTERSPEECH, page 290-293. ISCA, (2000)BLSTM-Based Confidence Estimation for End-to-End Speech Recognition., , , and . ICASSP, page 6383-6387. IEEE, (2021)Exploiting imbalanced textual and acoustic data for training prosodically-enhanced RNNLMs., , , , and . APSIPA, page 618-621. IEEE, (2017)Single Channel Target Speaker Extraction and Recognition with Speaker Beam., , , , and . ICASSP, page 5554-5558. IEEE, (2018)Automatic Vocabulary Adaptation Based on Semantic Similarity and Speech Recognition Confidence Measure., , , , , and . INTERSPEECH, page 2310-2313. ISCA, (2012)Predicting Speech Intelligibility of Enhanced Speech Using Phone Accuracy of DNN-Based ASR System., , , , , , and . INTERSPEECH, page 4275-4279. ISCA, (2019)Multimodal SpeakerBeam: Single Channel Target Speech Extraction with Audio-Visual Speaker Clues., , , , and . INTERSPEECH, page 2718-2722. ISCA, (2019)