Author of the publication

Singing-Tacotron: Global Duration Control Attention and Dynamic Filter for End-to-end Singing Voice Synthesis.

, , , , and . DDAM@MM, page 53-59. ACM, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

System Fingerprints Detection for DeepFake Audio: An Initial Dataset and Investigation., , , , , , and . CoRR, (2022)UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion., , , , , and . CoRR, (2023)Spatial and spectral deep attention fusion for multi-channel speech separation using deep embedding features., , , , and . CoRR, (2020)ADD 2022: the first Audio Deep Synthesis Detection Challenge., , , , , , , , , and 7 other author(s). ICASSP, page 9216-9220. IEEE, (2022)Context-Aware Mask Prediction Network for End-to-End Text-Based Speech Editing., , , , , and . ICASSP, page 6082-6086. IEEE, (2022)A Robust Deep Audio Splicing Detection Method via Singularity Detection Feature., , , , , , , and . ICASSP, page 2919-2923. IEEE, (2022)Distilling Knowledge for Distant Speech Recognition via Parallel Data., and . APSIPA, page 170-175. IEEE, (2019)Voice Activity Detection Based on Time-Delay Neural Networks., , , , and . APSIPA, page 1173-1178. IEEE, (2019)Batch Normalization based Unsupervised Speaker Adaptation for Acoustic Models., and . APSIPA, page 176-180. IEEE, (2019)Language-invariant Bottleneck Features from Adversarial End-to-end Acoustic Models for Low Resource Speech Recognition., , and . ICASSP, page 6071-6075. IEEE, (2019)