Author of the publication

Towards Data Selection on TTS Data for Children's Speech Recognition.

, , , , , and . ICASSP, page 6888-6892. IEEE, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Speaker Augmentation for Low Resource Speech Recognition., and . ICASSP, page 7719-7723. IEEE, (2020)UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding., , , , , , , , and . AAAI, page 17924-17932. AAAI Press, (2024)Rich Prosody Diversity Modelling with Phone-Level Mixture Density Network., and . Interspeech, page 3136-3140. ISCA, (2021)VQTTS: High-Fidelity Text-to-Speech Synthesis with Self-Supervised VQ Acoustic Feature., , , and . INTERSPEECH, page 1596-1600. ISCA, (2022)Towards Data Selection on TTS Data for Children's Speech Recognition., , , , , and . ICASSP, page 6888-6892. IEEE, (2021)Improving Code-Switching and Name Entity Recognition in ASR with Speech Editing based Data Augmentation., , , , , and . INTERSPEECH, page 919-923. ISCA, (2023)VALL-T: Decoder-Only Generative Transducer for Robust and Decoding-Controllable Text-to-Speech., , , , , , , , and . CoRR, (2024)Data Augmentation for End-to-end Code-switching Speech Recognition., , , , and . CoRR, (2020)Unsupervised Word-Level Prosody Tagging for Controllable Speech Synthesis., , and . ICASSP, page 7597-7601. IEEE, (2022)Multi-Speaker Multi-Lingual VQTTS System for LIMMITS 2023 Challenge., , , and . ICASSP, page 1-2. IEEE, (2023)