AN ANALYSIS OF SPEECH RECOGNITION PERFORMANCE BASED UPON NETWORK LAYERS AND TRANSFER FUNCTIONS

K. Kumar.
International Journal of Computer Science, Engineering and Applications (IJCSEA), 1 (3): 1-10 (июня 2011)

Full text

Аннотация

Speech is the most natural way of information exchange. It provides an efficient means of means of manmachine communication using speech interfacing. Speech interfacing involves speech synthesis and speech recognition. Speech recognition allows a computer to identify the words that a person speaks to a microphone or telephone. The two main components, normally used in speech recognition, are signal processing component at front-end and pattern matching component at back-end. In this paper, a setup that uses Mel frequency cepstral coefficients at front-end and artificial neural networks at back-end has been developed to perform the experiments for analyzing the speech recognition performance. Various experiments have been performed by varying the number of layers and type of network transfer function, which helps in deciding the network architecture to be used for acoustic modelling at back end.

ключ BibTeX: noauthororeditor
тип записи: article
год: 2011
месяц: June
журнал: International Journal of Computer Science, Engineering and Applications (IJCSEA)
номер: 3
страницы: 1-10
том: 1
Document: http://airccse.org/journal/ijcsea/papers/0611csea02.pdf

тэги

Пользователи данного ресурса

Комментарии и рецензиипоказать / перейти в невидимый режим

Пожалуйста, войдите в систему, чтобы принять участие в дискуссии (добавить собственные рецензию, или комментарий)

BibSonomy