Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Diffsound: Discrete Diffusion Model for Text-to-sound Generation.

D. Yang, J. Yu, H. Wang, W. Wang, C. Weng, Y. Zou, and D. Yu. CoRR, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Chao Weng

Hsi-Lin Chao

Kuo-Yih Chao

Zouheir Chaoui

Hanna Huey-Jiun Chao

Other publications of authors with the same name

Discriminative Training Using Non-Uniform Criteria for Keyword Spotting on Spontaneous Speech.C. Weng, and B. Juang. IEEE ACM Trans. Audio Speech Lang. Process., 23 (2): 300-312 (2015)Diffsound: Discrete Diffusion Model for Text-to-sound Generation.D. Yang, J. Yu, H. Wang, W. Wang, C. Weng, Y. Zou, and D. Yu. CoRR, (2022)Multi-Channel Speaker Diarization Using Spatial Features for Meetings.N. Zheng, N. Li, J. Yu, C. Weng, D. Su, X. Liu, and H. Meng. ICASSP, page 7337-7341. IEEE, (2022)Enhancing Speaking Styles in Conversational Text-to-Speech Synthesis with Graph-Based Multi-Modal Context Modeling.J. Li, Y. Meng, C. Li, Z. Wu, H. Meng, C. Weng, and D. Su. ICASSP, page 7917-7921. IEEE, (2022)Far-Field Location Guided Target Speech Extraction Using End-to-End Speech Recognition Objectives.A. Subramanian, C. Weng, M. Yu, S. Zhang, Y. Xu, S. Watanabe, and D. Yu. ICASSP, page 7299-7303. IEEE, (2020)High Fidelity Speech Enhancement with Band-split RNN.J. Yu, H. Chen, Y. Luo, R. Gu, and C. Weng. INTERSPEECH, page 2483-2487. ISCA, (2023)Raw Waveform Encoder with Multi-Scale Globally Attentive Locally Recurrent Networks for End-to-End Speech Recognition.M. Lam, J. Wang, C. Weng, D. Su, and D. Yu. Interspeech, page 316-320. ISCA, (2021)Bayes Risk CTC: Controllable CTC Alignment in Sequence-to-Sequence Tasks.J. Tian, B. Yan, J. Yu, C. Weng, D. Yu, and S. Watanabe. ICLR, OpenReview.net, (2023)Towards end-to-end Speaker Diarization with Generalized Neural Speaker Clustering.C. Zhang, J. Shi, C. Weng, M. Yu, and D. Yu. ICASSP, page 8372-8376. IEEE, (2022)Non-Autoregressive Transformer ASR with CTC-Enhanced Decoder Input.X. Song, Z. Wu, Y. Huang, C. Weng, D. Su, and H. Meng. ICASSP, page 5894-5898. IEEE, (2021)

BibSonomy

Disambiguation of "Weng, Chao"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Diffsound: Discrete Diffusion Model for Text-to-sound Generation.

Please choose a person to relate this publication to

Chao Weng

Hsi-Lin Chao

Kuo-Yih Chao

Zouheir Chaoui

Hanna Huey-Jiun Chao

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Weng, Chao"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Diffsound: Discrete Diffusion Model for Text-to-sound Generation.

Please choose a person to relate this publication to

Chao Weng

Hsi-Lin Chao

Kuo-Yih Chao

Zouheir Chaoui

Hanna Huey-Jiun Chao

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Diffsound: Discrete Diffusion Model for Text-to-sound Generation.