Restructuring speech representations using a pitch-adaptive time–frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
A set of simple new procedures has been developed to enable the real-time manipulation of speech parameters. The proposed method uses pitch-adaptive spectral analysis combined with a surface reconstruction method in the time–frequency region. The method also consists of a fundamental frequency (F0) extraction using instantaneous frequency calculation based on a new concept called `fundamentalness'. The proposed procedures preserve the details of time–frequency surfaces while almost perfectly removing fine structures due to signal periodicity. This close-to-perfect elimination of interferences and smooth F0 trajectory allow for over 600\% mani…(more)
Please log in to take part in the discussion (add own reviews or comments).
Cite this publication
More citation styles
- please select -
%0 Journal Article
%1 Kawahara1999
%A Kawahara, Hideki
%A Masuda-Katsuse, Ikuyo
%A de Cheveigné, Alain
%D 1999
%J Speech Communication
%K Speech analysis
%N 3–4
%P 187-207
%R 10.1016/S0167-6393(98)00085-5
%T Restructuring speech representations using a pitch-adaptive time–frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds
%V 27
%X A set of simple new procedures has been developed to enable the real-time manipulation of speech parameters. The proposed method uses pitch-adaptive spectral analysis combined with a surface reconstruction method in the time–frequency region. The method also consists of a fundamental frequency (F0) extraction using instantaneous frequency calculation based on a new concept called `fundamentalness'. The proposed procedures preserve the details of time–frequency surfaces while almost perfectly removing fine structures due to signal periodicity. This close-to-perfect elimination of interferences and smooth F0 trajectory allow for over 600\% manipulation of such speech parameters as pitch, vocal tract length, and speaking rate, while maintaining high reproductive quality.
@article{Kawahara1999,
abstract = {A set of simple new procedures has been developed to enable the real-time manipulation of speech parameters. The proposed method uses pitch-adaptive spectral analysis combined with a surface reconstruction method in the time–frequency region. The method also consists of a fundamental frequency (F0) extraction using instantaneous frequency calculation based on a new concept called `fundamentalness'. The proposed procedures preserve the details of time–frequency surfaces while almost perfectly removing fine structures due to signal periodicity. This close-to-perfect elimination of interferences and smooth F0 trajectory allow for over 600\%{} manipulation of such speech parameters as pitch, vocal tract length, and speaking rate, while maintaining high reproductive quality.},
added-at = {2021-02-01T10:51:23.000+0100},
author = {Kawahara, Hideki and Masuda-Katsuse, Ikuyo and de Cheveigné, Alain},
biburl = {https://www.bibsonomy.org/bibtex/2e08b10ea1f6f26be68456c3d2e04499b/m-toman},
doi = {10.1016/S0167-6393(98)00085-5},
file = {:pdfs/kawahara_specom_1999.pdf:PDF},
interhash = {8b63f74a6cb185be657f5823ccf55c91},
intrahash = {e08b10ea1f6f26be68456c3d2e04499b},
issn = {0167-6393},
journal = {Speech Communication},
keywords = {Speech analysis},
number = {3–4},
owner = {schabus},
pages = {187-207},
timestamp = {2021-02-01T10:51:23.000+0100},
title = {Restructuring speech representations using a pitch-adaptive time–frequency smoothing and an instantaneous-frequency-based F0 extraction: Possible role of a repetitive structure in sounds},
volume = 27,
year = 1999
}