Author of the publication

CosyVoice: A Scalable Multilingual Zero-shot Text-to-speech Synthesizer based on Supervised Semantic Tokens.

, , , , , , , , , , , and . CoRR, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Investigation of Monaural Front-End Processing for Robust Speech Recognition Without Retraining or Joint-Training., , and . APSIPA, page 249-254. IEEE, (2019)Acoustic Scene Classification by Implicitly Identifying Distinct Sound Events., , , and . INTERSPEECH, page 3860-3864. ISCA, (2019)Double Adversarial Network Based Monaural Speech Enhancement for Robust Speech Recognition., , and . INTERSPEECH, page 309-313. ISCA, (2020)TOLD: a Novel Two-Stage Overlap-Aware Framework for Speaker Diarization., , and . ICASSP, page 1-5. IEEE, (2023)IntrinsicVoice: Empowering LLMs with Intrinsic Real-time Voice Interaction Abilities., , , , , , , , , and 3 other author(s). CoRR, (2024)Personality-memory Gated Adaptation: An Efficient Speaker Adaptation for Personalized End-to-end Automatic Speech Recognition., , , , and . INTERSPEECH, ISCA, (2024)CASA-ASR: Context-Aware Speaker-Attributed ASR., , , , , , , and . INTERSPEECH, page 411-415. ISCA, (2023)M2MeT: The ICASSP 2022 Multi-Channel Multi-Party Meeting Transcription Challenge., , , , , , , , , and 2 other author(s). CoRR, (2021)Investigation of Monaural Front-End Processing for Robust ASR without Retraining or Joint-Training., , and . CoRR, (2018)A Joint Framework of Denoising Autoencoder and Generative Vocoder for Monaural Speech Enhancement., , and . IEEE ACM Trans. Audio Speech Lang. Process., (2020)