Author of the publication

ByteSing: A Chinese Singing Voice Synthesis System Using Duration Allocated Encoder-Decoder Acoustic Models and WaveRNN Vocoders.

, , , , , , , , and . ISCSLP, page 1-5. IEEE, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

PPG-Based Singing Voice Conversion with Adversarial Representation Learning., , , , , , and . ICASSP, page 7073-7077. IEEE, (2021)Generative Adversarial Network based Speaker Adaptation for High Fidelity WaveNet Vocoder., , , , and . CoRR, (2018)Towards Realistic Visual Dubbing with Heterogeneous Sources., , , , , , , , , and . ACM Multimedia, page 1739-1747. ACM, (2021)Towards Using Clothes Style Transfer for Scenario-Aware Person Video Generation., , , , , , and . ICASSP, page 1745-1749. IEEE, (2022)TranssionADD: A Multi-frame Reinforcement Based Sequence Tagging Model for Audio Deepfake Detection., , , , , , , and . DADA@IJCAI, volume 3597 of CEUR Workshop Proceedings, page 113-118. CEUR-WS.org, (2023)ByteSing: A Chinese Singing Voice Synthesis System Using Duration Allocated Encoder-Decoder Acoustic Models and WaveRNN Vocoders., , , , , , , , and . ISCSLP, page 1-5. IEEE, (2021)ByteSing: A Chinese Singing Voice Synthesis System Using Duration Allocated Encoder-Decoder Acoustic Models and WaveRNN Vocoders., , , , , , , , and . CoRR, (2020)CPNet: Exploiting CLIP-based Attention Condenser and Probability Map Guidance for High-fidelity Talking Face Generation., , , , and . ICME, page 240-245. IEEE, (2023)Improving Accent Conversion with Reference Encoder and End-To-End Text-To-Speech., , , , , , , , and . CoRR, (2020)Application of pronunciation knowledge on phoneme recognition by LSTM neural network., , , and . ICPR, page 2906-2911. IEEE, (2016)