Author of the publication

Spectro-Temporal Modelling with Time-Frequency LSTM and Structured Output Layer for Voice Conversion.

, , , , , and . INTERSPEECH, page 3409-3413. ISCA, (2017)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Transformer-S2A: Robust and Efficient Speech-to-Animation., , , , , and . ICASSP, page 7247-7251. IEEE, (2022)VAST: Vivify Your Talking Avatar via Zero-Shot Expressive Facial Style Transfer., , , , , , and . ICCV (Workshops), page 2969-2979. IEEE, (2023)Learning Contextual Representation with Convolution Bank and Multi-head Self-attention for Speech Emphasis Detection., , , , and . APSIPA, page 922-926. IEEE, (2019)Dilated Residual Network with Multi-head Self-attention for Speech Emotion Recognition., , , , and . ICASSP, page 6675-6679. IEEE, (2019)Multi-task learning of structured output layer bidirectional LSTMS for speech synthesis., , , , and . ICASSP, page 5510-5514. IEEE, (2017)Multi-Task Deep Learning for User Intention Understanding in Speech Interaction Systems., , , , , , and . AAAI, page 161-167. AAAI Press, (2017)Multi-modal Multi-scale Speech Expression Evaluation in Computer-Assisted Language Learning., , , , , and . AIMS, volume 10970 of Lecture Notes in Computer Science, page 16-28. Springer, (2018)Spectro-Temporal Modelling with Time-Frequency LSTM and Structured Output Layer for Voice Conversion., , , , , and . INTERSPEECH, page 3409-3413. ISCA, (2017)Enhancing Monotonicity for Robust Autoregressive Transformer TTS., , , , , and . INTERSPEECH, page 3181-3185. ISCA, (2020)Siamese Recurrent Auto-Encoder Representation for Query-by-Example Spoken Term Detection., , , , and . INTERSPEECH, page 102-106. ISCA, (2018)