Author of the publication

Estimating Mutual Information in Prosody Representation for Emotional Prosody Transfer in Speech Synthesis.

, , , and . ISCSLP, page 1-5. IEEE, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Fine-grained style modelling and transfer in text-to-speech synthesis via content-style disentanglement., and . CoRR, (2020)Unsupervised Spoken Term Discovery Based on Re-clustering of Hypothesized Speech Segments with Siamese and Triplet Networks., and . CoRR, (2020)Enhancing Segment-Based Speech Emotion Recognition by Deep Self-Learning., , and . CoRR, (2021)Acoustical Analysis of Speech Under Physical Stress in Relation to Physical Activities and Physical Literacy., , , and . CoRR, (2021)A study on the efficacy of model pre-training in developing neural text-to-speech system., , , , , , , and . CoRR, (2021)iEmoTTS: Toward Robust Cross-Speaker Emotion Transfer and Control for Speech Synthesis Based on Disentanglement Between Prosody and Timbre., , , , , , , and . IEEE ACM Trans. Audio Speech Lang. Process., (2023)Noise-robust automatic speech recognition using mainlobe-resilient time-frequency quantile-based noise estimation., , and . ISCAS (3), page 425-428. IEEE, (2004)Isolated word recognition using modular recurrent neural networks., , and . Pattern Recognit., 31 (6): 751-760 (1998)Modeling Cantonese Pronunciation Variations for Large-Vocabulary Continuous Speech Recognition., , and . IJCLCLP, (2006)Static and Dynamic Spectral Features: Their Noise Robustness and Optimal Weights for ASR., , and . IEEE Trans. Speech Audio Process., 15 (3): 1087-1097 (2007)