Author of the publication

CosyVoice: A Scalable Multilingual Zero-shot Text-to-speech Synthesizer based on Supervised Semantic Tokens.

, , , , , , , , , , , and . CoRR, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Domain Generalization Capability Enhancement for Binary Neural Networks., , and . BMVC, page 13. BMVA Press, (2022)RAM: A Region-Aware Deep Model for Vehicle Re-Identification., , , and . ICME, page 1-6. IEEE Computer Society, (2018)M2Met: The Icassp 2022 Multi-Channel Multi-Party Meeting Transcription Challenge., , , , , , , , , and 2 other author(s). ICASSP, page 6167-6171. IEEE, (2022)Simplified Self-Attention for Transformer-Based end-to-end Speech Recognition., , , and . SLT, page 75-81. IEEE, (2021)MFCCA:Multi-Frame Cross-Channel Attention for Multi-Speaker ASR in Multi-Party Meeting Scenario., , , , , , and . SLT, page 144-151. IEEE, (2022)Personalized Visual Vocabulary Adaption for Social Image Retrieval., , , and . ACM Multimedia, page 993-996. ACM, (2014)Learning attribute-aware dictionary for image classification and search., , , , and . ICMR, page 33-40. ACM, (2013)Scalable mobile search with binary phrase., , , , and . ICIMCS, page 66-70. ACM, (2013)Investigation of Transformer Based Spelling Correction Model for CTC-Based End-to-End Mandarin Speech Recognition., , and . INTERSPEECH, page 2180-2184. ISCA, (2019)FunASR: A Fundamental End-to-End Speech Recognition Toolkit., , , , , , , , , and . INTERSPEECH, page 1593-1597. ISCA, (2023)