copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

A Style Adaptation Technique for Speech Synthesis Using HSMM and Suprasegmental Features

M. Tachibana, J. Yamagishi, T. Masuko, and T. Kobayashi. IEICE Transactions on Information and Systems, E89-D (3): 1092-1099 (March 2006)
DOI: 10.1093/ietisy/e89-d.3.1092

Abstract

This paper proposes a technique for synthesizing speech with a desired speaking style and/or emotional expression, based on model adaptation in an HMM-based speech synthesis framework. Speaking styles and emotional expressions are characterized by many segmental and suprasegmental features in both spectral and prosodic features. Therefore, it is essential to take account of these features in the model adaptation. The proposed technique called style adaptation, deals with this issue. Firstly, the maximum likelihood linear regression (MLLR) algorithm, based on a framework of hidden semi-Markov model (HSMM) is presented to provide a mathematically rigorous and robust adaptation of state duration and to adapt both the spectral and prosodic features. Then, a novel tying method for the regression matrices of the MLLR algorithm is also presented to allow the incorporation of both the segmental and suprasegmental speech features into the style adaptation. The proposed tying method uses regression class trees with contextual information. From the results of several subjective tests, we show that these techniques can perform style adaptation while maintaining naturalness of the synthetic speech.

@m-toman's tags highlighted

Cite this publication

@article{Tachibana2006, abstract = {This paper proposes a technique for synthesizing speech with a desired speaking style and/or emotional expression, based on model adaptation in an HMM-based speech synthesis framework. Speaking styles and emotional expressions are characterized by many segmental and suprasegmental features in both spectral and prosodic features. Therefore, it is essential to take account of these features in the model adaptation. The proposed technique called style adaptation, deals with this issue. Firstly, the maximum likelihood linear regression (MLLR) algorithm, based on a framework of hidden semi-Markov model (HSMM) is presented to provide a mathematically rigorous and robust adaptation of state duration and to adapt both the spectral and prosodic features. Then, a novel tying method for the regression matrices of the MLLR algorithm is also presented to allow the incorporation of both the segmental and suprasegmental speech features into the style adaptation. The proposed tying method uses regression class trees with contextual information. From the results of several subjective tests, we show that these techniques can perform style adaptation while maintaining naturalness of the synthetic speech.}, added-at = {2021-02-01T10:51:23.000+0100}, author = {Tachibana, Makoto and Yamagishi, Junichi and Masuko, Takashi and Kobayashi, Takao}, biburl = {https://www.bibsonomy.org/bibtex/21a24f96ab0eaa130cb7c645d2a954597/m-toman}, doi = {10.1093/ietisy/e89-d.3.1092}, file = {:pdfs/tachibana_ieice_2006.pdf:PDF}, interhash = {aad6826f313b1941eeefcc000e5b7f3e}, intrahash = {1a24f96ab0eaa130cb7c645d2a954597}, journal = {IEICE Transactions on Information and Systems}, keywords = {(HSMM), (MLLR) HMM-based adaptation, emotional expression, hidden likelihood linear maximum model regression semi-Markov speaking speech style style, synthesis,}, month = mar, number = 3, owner = {schabus}, pages = {1092-1099}, timestamp = {2021-02-01T10:51:23.000+0100}, title = {A Style Adaptation Technique for Speech Synthesis Using HSMM and Suprasegmental Features}, volume = {E89-D}, year = 2006 }

BibSonomy

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

A Style Adaptation Technique for Speech Synthesis Using HSMM and Suprasegmental Features

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews
(0)

BibSonomy

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML A Style Adaptation Technique for Speech Synthesis Using HSMM and Suprasegmental Features

Abstract

Links and resources

Tags

community

Cite this publication

More citation styles

search on

Meta data

Comments and Reviews (0)

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

A Style Adaptation Technique for Speech Synthesis Using HSMM and Suprasegmental Features

Comments and Reviews
(0)