Article,

DNN-based performance measures for predicting error rates in automatic speech recognition and optimizing hearing aid parameters

A. Martinez, L. Gerlach, G. Payá Vayá, H. Hermansky, J. Ooster, and B. Meyer.
Speech Communication, (2018)
DOI: https://doi.org/10.1016/j.specom.2018.11.006

Abstract

In several applications of machine listening, predicting how well an automatic speech recognition system will perform before the actual decoding enables the system to adapt to unseen acoustic characteristics dynamically. Feedback about speech quality, for instance, could allow modern hearing aids to select a speech source in complex acoustic scenes with the aim of enhancing the speech intelligibility of a target speaker. In this study, we look at different performance measures to estimate the word error rates of simulated behind-the-ear hearing aid signals and detect the azimuth angle of the target source in 180-degree spatial scenes. These measures derive from phoneme posterior probabilities produced by a deep neural network acoustic model. However, the more complex the model is, the more computationally expensive it becomes to obtain these measures; therefore, we assess how the model size affects prediction performance. Our findings suggest measures derived from smaller nets are suitable to predict error rates of more complex models reliably enough to be implemented in hearing aid hardware.

BibTeX key: martinez2018dnnbased
entry type: article
year: 2018
journal: Speech Communication
pages: 44 - 56
volume: 106
issn: 0167-6393
DOI: https://doi.org/10.1016/j.specom.2018.11.006
url: http://www.sciencedirect.com/science/article/pii/S0167639317303850

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

@article{martinez2018dnnbased, abstract = {In several applications of machine listening, predicting how well an automatic speech recognition system will perform before the actual decoding enables the system to adapt to unseen acoustic characteristics dynamically. Feedback about speech quality, for instance, could allow modern hearing aids to select a speech source in complex acoustic scenes with the aim of enhancing the speech intelligibility of a target speaker. In this study, we look at different performance measures to estimate the word error rates of simulated behind-the-ear hearing aid signals and detect the azimuth angle of the target source in 180-degree spatial scenes. These measures derive from phoneme posterior probabilities produced by a deep neural network acoustic model. However, the more complex the model is, the more computationally expensive it becomes to obtain these measures; therefore, we assess how the model size affects prediction performance. Our findings suggest measures derived from smaller nets are suitable to predict error rates of more complex models reliably enough to be implemented in hearing aid hardware.}, added-at = {2022-02-23T09:36:58.000+0100}, author = {Martinez, Angel Mario Castro and Gerlach, Lukas and Pay{\'a} Vay{\'a}, Guillermo and Hermansky, Hynek and Ooster, Jasper and Meyer, Bernd T.}, biburl = {https://www.bibsonomy.org/bibtex/22c70d1cb65b822e43546a9b51fdcff32/guipava}, description = {DNN-based performance measures for predicting error rates in automatic speech recognition and optimizing hearing aid parameters - ScienceDirect}, doi = {https://doi.org/10.1016/j.specom.2018.11.006}, interhash = {01c57009810c976edf7588c9a3553102}, intrahash = {2c70d1cb65b822e43546a9b51fdcff32}, issn = {0167-6393}, journal = {Speech Communication}, keywords = {myown}, pages = {44 - 56}, timestamp = {2022-02-23T09:36:58.000+0100}, title = {DNN-based performance measures for predicting error rates in automatic speech recognition and optimizing hearing aid parameters}, url = {http://www.sciencedirect.com/science/article/pii/S0167639317303850}, volume = 106, year = 2018 }

BibSonomy

DNN-based performance measures for predicting error rates in automatic speech recognition and optimizing hearing aid parameters

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on