Author of the publication

Investigating Content-Aware Neural Text-to-Speech MOS Prediction Using Prosodic and Linguistic Features.

, , , , , , and . ICASSP, page 1-5. IEEE, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Vibrato Learning in Multi-Singer Singing Voice Synthesis., , , , and . ASRU, page 773-779. IEEE, (2021)Cross-Lingual Low Resource Speaker Adaptation Using Phonological Features., , , , , , , and . Interspeech, page 1594-1598. ISCA, (2021)Controllable speech synthesis by learning discrete phoneme-level prosodic representations., , , , , , and . Speech Commun., (2023)Factored Maximum Penalized Likelihood Kernel Regression for HMM-Based Style-Adaptive Speech Synthesis., , and . J. Sel. Topics Signal Processing, 8 (2): 251-261 (2014)Word-Level Style Control for Expressive, Non-attentive Speech Synthesis., , , , and . SPECOM, volume 12997 of Lecture Notes in Computer Science, page 336-347. Springer, (2021)Investigating Content-Aware Neural Text-to-Speech MOS Prediction Using Prosodic and Linguistic Features., , , , , , and . ICASSP, page 1-5. IEEE, (2023)Factored MLLR Adaptation Algorithm for HMM-based Expressive TTS., , , and . INTERSPEECH, page 975-978. ISCA, (2012)Fine-grained Noise Control for Multispeaker Speech Synthesis., , , , , , , , , and . INTERSPEECH, page 828-832. ISCA, (2022)SOMOS: The Samsung Open MOS Dataset for the Evaluation of Neural Text-to-Speech Synthesis., , , , , , , , and . INTERSPEECH, page 2388-2392. ISCA, (2022)Artificial stereo data generation for speech feature mapping., , , , and . ICASSP, page 4897-4900. IEEE, (2012)