Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches.

Z. Zhao, D. Yang, R. Gu, H. Zhang, and Y. Zou. INTERSPEECH, page 5333-5337. ISCA, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Dongchao Hou

Yang Yang

Other publications of authors with the same name

Omnidirectional Motion Control Method of Quadruped Robot Based on 3D-CPG Oscillator Group.B. Tao, D. Yang, G. Huang, Z. Zeng, C. Chen, and T. Li. CLAWAR, volume 530 of Lecture Notes in Networks and Systems, page 301-312. Springer, (2022)InstructTTS: Modelling Expressive TTS in Discrete Latent Space with Natural Language Style Prompt.D. Yang, S. Liu, R. Huang, G. Lei, C. Weng, H. Meng, and D. Yu. CoRR, (2023)DPM-TSE: A Diffusion Probabilistic Model for Target Sound Extraction.J. Hai, H. Wang, D. Yang, K. Thakkar, N. Dehak, and M. Elhilali. CoRR, (2023)RaDur: A Reference-aware and Duration-robust Network for Target Sound Detection.D. Yang, H. Wang, Z. Ye, Y. Zou, and W. Wang. INTERSPEECH, page 1511-1515. ISCA, (2022)Improving Target Sound Extraction with Timestamp Information.H. Wang, D. Yang, C. Weng, J. Yu, and Y. Zou. INTERSPEECH, page 1526-1530. ISCA, (2022)PromptTTS 2: Describing and Generating Voices with Text Prompt.Y. Leng, Z. Guo, K. Shen, X. Tan, Z. Ju, Y. Liu, Y. Liu, D. Yang, L. Zhang, K. Song and 5 other author(s). CoRR, (2023)InstructSpeech: Following Speech Editing Instructions via Large Language Models.R. Huang, R. Hu, Y. Wang, Z. Wang, X. Cheng, Z. Jiang, Z. Ye, D. Yang, L. Liu, P. Gao and 1 other author(s). ICML, OpenReview.net, (2024)Diffsound: Discrete Diffusion Model for Text-to-sound Generation.D. Yang, J. Yu, H. Wang, W. Wang, C. Weng, Y. Zou, and D. Yu. CoRR, (2022)Improving the Performance of Automated Audio Captioning via Integrating the Acoustic and Semantic Information.Z. Ye, H. Wang, D. Yang, and Y. Zou. DCASE, page 40-44. (2021)YOLOv3 with Asymmetric Intersection over Union Based Loss Function for Human Detection.H. Zhu, D. Yang, G. Huang, Q. Wu, T. Li, and B. Tao. ICMLSC, page 70-76. ACM, (2021)

BibSonomy

Disambiguation of "Yang, Dongchao"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches.

Please choose a person to relate this publication to

Dongchao Hou

Dongchao Hou

Yang Yang

Yang Yang

Yang Yang

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Yang, Dongchao"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches.

Please choose a person to relate this publication to

Dongchao Hou

Dongchao Hou

Yang Yang

Yang Yang

Yang Yang

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Target Confusion in End-to-end Speaker Extraction: Analysis and Approaches.