Author of the publication

Efficient likelihood computation in multi-stream HMM based audio-visual speech recognition.

, , , and . INTERSPEECH, page 2297-2300. ISCA, (2004)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

The IBM Rich Transcription 2007 Speech-to-Text Systems for Lecture Meetings., , , , and . CLEAR, volume 4625 of Lecture Notes in Computer Science, page 429-441. Springer, (2007)The IBM RT06s Evaluation System for Speech Activity Detection in CHIL Seminars., , , and . MLMI, volume 4299 of Lecture Notes in Computer Science, page 323-335. Springer, (2006)Multistage information fusion for audio-visual speech recognition., , , , and . ICME, page 1651-1654. IEEE Computer Society, (2004)Self-critical Sequence Training for Image Captioning., , , , and . CoRR, (2016)Towards a domain-independent ASR-confidence classifier., , and . ICASSP, page 4929-4932. IEEE, (2012)The IBM speech-to-speech translation system for smartphone: Improvements for resource-constrained tasks., , , , , , , , , and 2 other author(s). Comput. Speech Lang., 27 (2): 592-618 (2013)Acoustic-Similarity Based Technique to Improve Concept Recognition., , , and . INTERSPEECH, page 1013-1016. ISCA, (2011)Detection, diarization, and transcription of far-field lecture speech., , , , and . INTERSPEECH, page 2161-2164. ISCA, (2007)An Extensible Language Interfacefor Robot Manipulation., , , , and . AGI, volume 7716 of Lecture Notes in Computer Science, page 21-30. Springer, (2012)A real-time prototype for small-vocabulary audio-visual ASR., , , , , and . ICME, page 469-472. IEEE Computer Society, (2003)