Author of the publication

Xiaoicesing 2: A High-Fidelity Singing Voice Synthesizer Based on Generative Adversarial Network.

, , and . INTERSPEECH, page 5401-5405. ISCA, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

HAM-TTS: Hierarchical Acoustic Modeling for Token-Based Zero-Shot Text-to-Speech with Model and Data Scaling., , , , , , , , and . CoRR, (2024)Deep Spectro-temporal Artifacts for Detecting Synthesized Speech., , , , , , , , , and . DDAM@MM, page 69-75. ACM, (2022)Joint Speaker Encoder and Neural Back-end Model for Fully End-to-End Automatic Speaker Verification with Multiple Enrollment Utterances., , , , and . CoRR, (2022)Data Augmentation Using McAdams-Coefficient-Based Speaker Anonymization for Fake Audio Detection., , , , , , , , , and . INTERSPEECH, page 664-668. ISCA, (2022)Spoofing-Aware Attention based ASV Back-end with Multiple Enrollment Utterances and a Sampling Strategy for the SASV Challenge 2022., , , and . INTERSPEECH, page 2883-2887. ISCA, (2022)Xiaoicesing 2: A High-Fidelity Singing Voice Synthesizer Based on Generative Adversarial Network., , and . INTERSPEECH, page 5401-5405. ISCA, (2023)Improving Generalization Ability of Countermeasures for New Mismatch Scenario by Combining Multiple Advanced Regularization Terms., , , , and . INTERSPEECH, page 1998-2002. ISCA, (2023)Joint speaker encoder and neural back-end model for fully end-to-end automatic speaker verification with multiple enrollment utterances., , , , and . Comput. Speech Lang., (2024)A Benchmark for Multi-speaker Anonymization., , , and . CoRR, (2024)Crosssinger: A Cross-Lingual Multi-Singer High-Fidelity Singing Voice Synthesizer Trained on Monolingual Singers., , , and . ASRU, page 1-6. IEEE, (2023)