Author of the publication

Improving Audio-Visual Speech Recognition by Lip-Subword Correlation Based Visual Pre-training and Cross-Modal Fusion Encoder.

, , , , , , and . ICME, page 2627-2632. IEEE, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Application of hidden Markov models for recognition of a limited set of words in unconstrained speech., , and . ICASSP, page 254-257. IEEE, (1989)HMM clustering for connected word recognition., , , and . ICASSP, page 405-408. IEEE, (1989)Bayesian Learning of Gaussian Mixture Densities for Hidden Markov Models., and . HLT, Morgan Kaufmann, (1991)An SNR-incremental stochastic matching algorithm for noisy speech recognition., , and . IEEE Trans. Speech Audio Process., 9 (8): 866-873 (2001)A new approach to utterance verification based on neighborhood information in model space., and . IEEE Trans. Speech Audio Process., 11 (5): 425-434 (2003)TT+GT at TRECVID 2010 Workshop., , , , , , and . TRECVID, National Institute of Standards and Technology (NIST), (2010)Unsupervised adaptation using structural Bayes approach., and . ICASSP, page 793-796. IEEE, (1998)On stochastic feature and model compensation approaches to robust speech recognition.. Speech Commun., 25 (1-3): 29-47 (1998)An Information-Extraction Approach to Speech Processing: Analysis, Detection, Verification, and Recognition., and . Proc. IEEE, 101 (5): 1089-1115 (2013)Improving Deep Neural Network Based Speech Synthesis through Contextual Feature Parametrization and Multi-Task Learning., , , , and . J. Signal Process. Syst., 90 (7): 1025-1037 (2018)