Inproceedings,

HMM-Based Hierarchical Unit Selection Combining Kullback-Leibler Divergence with Likelihood Criterion

Z. Ling, and R. Wang.
Proceedings of the 2007 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 4, page 1245-1248. Honolulu, HI, USA, (April 2007)
DOI: 10.1109/ICASSP.2007.367302

Abstract

This paper presents a hidden Markov model (HMM) based unit selection method using hierarchical units under statistical criterion. In our previous work we tried to use frame sized speech segments and maximum likelihood criterion to improve the performance of traditional concatenative synthesis system using phone sized units and cost function criterion. In this paper, hierarchical units which consist of phone level units and frame level units are adopted to achieve better balance between the coverage rate of candidate unit and the number of concatenation points during synthesis. Besides, Kullback-Leibler divergence (KLD) between candidate and target phoneme HMMs is introduced as a part of the final criterion for unit selection. The listening result proves that these two approaches can improve the performance of synthetic speech effectively.

BibTeX key: Ling2007
entry type: inproceedings
address: Honolulu, HI, USA
booktitle: Proceedings of the 2007 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
year: 2007
month: apr
pages: 1245-1248
volume: 4
owner: schabus
file: :pdfs/ling_icassp_2007.pdf:PDF
issn: 1520-6149
DOI: 10.1109/ICASSP.2007.367302

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

@inproceedings{Ling2007, abstract = {This paper presents a hidden Markov model (HMM) based unit selection method using hierarchical units under statistical criterion. In our previous work we tried to use frame sized speech segments and maximum likelihood criterion to improve the performance of traditional concatenative synthesis system using phone sized units and cost function criterion. In this paper, hierarchical units which consist of phone level units and frame level units are adopted to achieve better balance between the coverage rate of candidate unit and the number of concatenation points during synthesis. Besides, Kullback-Leibler divergence (KLD) between candidate and target phoneme HMMs is introduced as a part of the final criterion for unit selection. The listening result proves that these two approaches can improve the performance of synthetic speech effectively.}, added-at = {2021-02-01T10:51:23.000+0100}, address = {Honolulu, HI, USA}, author = {Ling, Zhen-Hua and Wang, Ren-Hua}, biburl = {https://www.bibsonomy.org/bibtex/20eee51bbc5f24256470ec721c847db71/m-toman}, booktitle = {Proceedings of the 2007 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)}, doi = {10.1109/ICASSP.2007.367302}, file = {:pdfs/ling_icassp_2007.pdf:PDF}, interhash = {7b3cfc55dd988d56619ce70518702be8}, intrahash = {0eee51bbc5f24256470ec721c847db71}, issn = {1520-6149}, keywords = {Markov Synthesis criterion;Context criterion;frame criterion;maximum criterion;phone divergence;concatenative estimation;speech function function;Databases;Diversity hidden hierarchical level likelihood model;likelihood modeling;Cost models;Signal models;maximum programming;Flowcharts;Hidden reception;Dynamic segments;hidden selection;Kullback-Leibler sized speech synthesis synthesis;HMM-based synthesis;Speech synthesis;Synthesizers;HMM;KLD;Speech system;cost unit units;frame units;statistical}, month = apr, owner = {schabus}, pages = {1245-1248}, timestamp = {2021-02-01T10:51:23.000+0100}, title = {HMM-Based Hierarchical Unit Selection Combining Kullback-Leibler Divergence with Likelihood Criterion}, volume = 4, year = 2007 }

BibSonomy

HMM-Based Hierarchical Unit Selection Combining Kullback-Leibler Divergence with Likelihood Criterion

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on