Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

CosyVoice: A Scalable Multilingual Zero-shot Text-to-speech Synthesizer based on Supervised Semantic Tokens.

Z. Du, Q. Chen, S. Zhang, K. Hu, H. Lu, Y. Yang, H. Hu, S. Zheng, Y. Gu, Z. Ma, Z. Gao, and Z. Yan. CoRR, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Shiliang Li

Zhang Zhang

Meng Zhang

Methods and implementations of road-network matchingM. Zhang. TU München, (2009)

Shuying Zhang

Fuqing Zhang

Other publications of authors with the same name

Domain Generalization Capability Enhancement for Binary Neural Networks.J. Ye, S. Mao, and S. Zhang. BMVC, page 13. BMVA Press, (2022)RAM: A Region-Aware Deep Model for Vehicle Re-Identification.X. Liu, S. Zhang, Q. Huang, and W. Gao. ICME, page 1-6. IEEE Computer Society, (2018)M2Met: The Icassp 2022 Multi-Channel Multi-Party Meeting Transcription Challenge.F. Yu, S. Zhang, Y. Fu, L. Xie, S. Zheng, Z. Du, W. Huang, P. Guo, Z. Yan, B. Ma and 2 other author(s). ICASSP, page 6167-6171. IEEE, (2022)Simplified Self-Attention for Transformer-Based end-to-end Speech Recognition.H. Luo, S. Zhang, M. Lei, and L. Xie. SLT, page 75-81. IEEE, (2021)MFCCA:Multi-Frame Cross-Channel Attention for Multi-Speaker ASR in Multi-Party Meeting Scenario.F. Yu, S. Zhang, P. Guo, Y. Liang, Z. Du, Y. Lin, and L. Xie. SLT, page 144-151. IEEE, (2022)Personalized Visual Vocabulary Adaption for Social Image Retrieval.Z. Niu, S. Zhang, X. Gao, and Q. Tian. ACM Multimedia, page 993-996. ACM, (2014)Learning attribute-aware dictionary for image classification and search.J. Cai, Z. Zha, H. Luan, S. Zhang, and Q. Tian. ICMR, page 33-40. ACM, (2013)Scalable mobile search with binary phrase.Q. Luo, S. Zhang, T. Huang, W. Gao, and Q. Tian. ICIMCS, page 66-70. ACM, (2013)Investigation of Transformer Based Spelling Correction Model for CTC-Based End-to-End Mandarin Speech Recognition.S. Zhang, M. Lei, and Z. Yan. INTERSPEECH, page 2180-2184. ISCA, (2019)FunASR: A Fundamental End-to-End Speech Recognition Toolkit.Z. Gao, Z. Li, J. Wang, H. Luo, X. Shi, M. Chen, Y. Li, L. Zuo, Z. Du, and S. Zhang. INTERSPEECH, page 1593-1597. ISCA, (2023)

BibSonomy

Disambiguation of "Zhang, Shiliang"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

CosyVoice: A Scalable Multilingual Zero-shot Text-to-speech Synthesizer based on Supervised Semantic Tokens.

Please choose a person to relate this publication to

Shiliang Li

Zhang Zhang

Meng Zhang

Shuying Zhang

Fuqing Zhang

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Zhang, Shiliang"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML CosyVoice: A Scalable Multilingual Zero-shot Text-to-speech Synthesizer based on Supervised Semantic Tokens.

Please choose a person to relate this publication to

Shiliang Li

Zhang Zhang

Meng Zhang

Shuying Zhang

Fuqing Zhang

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

CosyVoice: A Scalable Multilingual Zero-shot Text-to-speech Synthesizer based on Supervised Semantic Tokens.