Author of the publication

VQMIVC: Vector Quantization and Mutual Information-Based Unsupervised Speech Representation Disentanglement for One-Shot Voice Conversion.

, , , , , and . Interspeech, page 1344-1348. ISCA, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Online Speaker Diarization with Core Samples Selection., , , , and . INTERSPEECH, page 1466-1470. ISCA, (2022)Structured mean field method for single-microphone speech separation with factorial Hidden Markov Model., and . ChinaSIP, page 122-126. IEEE, (2013)CorrectSpeech: A Fully Automated System for Speech Correction and Accent Reduction., , , , , , and . ISCSLP, page 81-85. IEEE, (2022)A Time Domain Progressive Learning Approach with SNR Constriction for Single-Channel Speech Enhancement and Recognition., , , and . ICASSP, page 6277-6281. IEEE, (2022)Streamable Speech Representation Disentanglement and Multi-Level Prosody Modeling for Live One-Shot Voice Conversion., , , , and . INTERSPEECH, page 2578-2582. ISCA, (2022)Improving End-to-End Speech Processing by Efficient Text Data Utilization with Latent Synthesis., , , , , and . EMNLP (Findings), page 4916-4928. Association for Computational Linguistics, (2023)EditSpeech: A Text Based Speech Editing System Using Partial Inference and Bidirectional Fusion., , , , , and . CoRR, (2021)Prosody for Mandarin speech recognition: a comparative study of read and spontaneous speech., , , and . INTERSPEECH, page 1133-1136. ISCA, (2008)SPIRAL: Self-supervised Perturbation-Invariant Representation Learning for Speech Pre-Training., , , , and . ICLR, OpenReview.net, (2022)ToneUnit: A Speech Discretization Approach for Tonal Language Speech Synthesis., , , , and . CoRR, (2024)