Author of the publication

Singing-Tacotron: Global Duration Control Attention and Dynamic Filter for End-to-end Singing Voice Synthesis.

, , , , and . DDAM@MM, page 53-59. ACM, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Fully Automated End-to-End Fake Audio Detection., , , , , , , , and . DDAM@MM, page 27-33. ACM, (2022)Half-Truth: A Partially Fake Audio Detection Dataset., , , , , , , and . Interspeech, page 1654-1658. ISCA, (2021)Dynamic Speaker Representations Adjustment and Decoder Factorization for Speaker Adaptation in End-to-End Speech Synthesis., , , , , and . INTERSPEECH, page 4701-4705. ISCA, (2020)System Fingerprints Detection for DeepFake Audio: An Initial Dataset and Investigation., , , , , , and . CoRR, (2022)UnifySpeech: A Unified Framework for Zero-shot Text-to-Speech and Voice Conversion., , , , , and . CoRR, (2023)ADD 2022: the first Audio Deep Synthesis Detection Challenge., , , , , , , , , and 7 other author(s). ICASSP, page 9216-9220. IEEE, (2022)Context-Aware Mask Prediction Network for End-to-End Text-Based Speech Editing., , , , , and . ICASSP, page 6082-6086. IEEE, (2022)Focusing on Attention: Prosody Transfer and Adaptative Optimization Strategy for Multi-Speaker End-to-End Speech Synthesis., , , , and . ICASSP, page 6709-6713. IEEE, (2020)Dynamic Soft Windowing and Language Dependent Style Token for Code-Switching End-to-End Speech Synthesis., , , , , and . INTERSPEECH, page 2937-2941. ISCA, (2020)Spoken Content and Voice Factorization for Few-Shot Speaker Adaptation., , , , , and . INTERSPEECH, page 796-800. ISCA, (2020)