@flint63

Improved Performance of Aurora 4 Using HTK and Unsupervised MLLR Adaptation

, and . Proceedings of Interspeech 2004---ICSLP: 8th International Conference on Spoken Language Processing, Jeju Island, Korea, page 161-164. (2004)

Abstract

The introduction of Aurora 4 tasks provides a standard database and methodology for comparing the effectiveness of different robust algorithms on LVCSR. One important issue on Aurora 4 tasks is the computation time involved in evaluating different test conditions. In this paper we show that by employing HTK as the recognition frontend and backend on Aurora 4 tasks with the use of cepstral mean subtraction, 14 percent relative improvement is achieved on the baseline clean train tasks at a 82.5 percent time reduction in training time and 40 percent time reduction on decoding. Furthermore, we found that optimizing the model complexity can increase the recognition performance (in both computation time and accuracy). Accuracy can be further improved with the use of unsupervised MLLR adaptation on one or multiple sentences. The adaptation results show that most of the gain from adaptation comes from adapting to the environment instead of to the speaker. With the use of adaptation,the error rate is reduced from the baseline result of 69.6 percent to 40 percent.

Links and resources

Tags