Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Self-Supervised Learning with Bi-Label Masked Speech Prediction for Streaming Multi-Talker Speech Recognition.

Z. Huang, Z. Chen, N. Kanda, J. Wu, Y. Wang, J. Li, T. Yoshioka, X. Wang, and P. Wang. ICASSP, page 1-5. IEEE, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Jinyu Kang

Jinyu Zhao

Li Li

Other publications of authors with the same name

Survey and evaluation of monocular visual-inertial SLAM algorithms for augmented reality.J. Li, B. Yang, D. Chen, N. Wang, G. Zhang, and H. Bao. Virtual Real. Intell. Hardw., 1 (4): 386-410 (2019)Building High-Accuracy Multilingual ASR With Gated Language Experts and Curriculum Training.E. Sun, J. Li, Y. Hu, Y. Zhu, L. Zhou, J. Xue, P. Wang, L. Liu, S. Liu, E. Lin and 1 other author(s). ASRU, page 1-7. IEEE, (2023)Continuous Speech Separation with Conformer.S. Chen, Y. Wu, Z. Chen, J. Wu, J. Li, T. Yoshioka, C. Wang, S. Liu, and M. Zhou. ICASSP, page 5749-5753. IEEE, (2021)LongFNT: Long-Form Speech Recognition with Factorized Neural Transducer.X. Gong, Y. Wu, J. Li, S. Liu, R. Zhao, X. Chen, and Y. Qian. ICASSP, page 1-5. IEEE, (2023)Fast and Accurate Factorized Neural Transducer for Text Adaption of End-to-End Speech Recognition Models.R. Zhao, J. Xue, P. Parthasarathy, V. Miljanic, and J. Li. ICASSP, page 1-5. IEEE, (2023)Endpoint Detection for Streaming End-to-End Multi-Talker ASR.L. Lu, J. Li, and Y. Gong. ICASSP, page 7312-7316. IEEE, (2022)Improving Self-Supervised Learning for Speech Recognition with Intermediate Layer Supervision.C. Wang, Y. Wu, S. Chen, S. Liu, J. Li, Y. Qian, and Z. Yang. ICASSP, page 7092-7096. IEEE, (2022)Listen, Look and Deliberate: Visual context-aware speech recognition using pre-trained text-video representations.S. Ghorbani, Y. Gaur, Y. Shi, and J. Li. CoRR, (2020)Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers.C. Wang, S. Chen, Y. Wu, Z. Zhang, L. Zhou, S. Liu, Z. Chen, Y. Liu, H. Wang, J. Li and 3 other author(s). CoRR, (2023)Enhanced Edge-Perceptual Guided Image Filtering.J. Li. CoRR, (2023)

BibSonomy

Disambiguation of "Li, Jinyu"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Self-Supervised Learning with Bi-Label Masked Speech Prediction for Streaming Multi-Talker Speech Recognition.

Please choose a person to relate this publication to

Jinyu Kang

Jinyu Zhao

Li Li

Li Li

Li Li

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Li, Jinyu"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Self-Supervised Learning with Bi-Label Masked Speech Prediction for Streaming Multi-Talker Speech Recognition.

Please choose a person to relate this publication to

Jinyu Kang

Jinyu Zhao

Li Li

Li Li

Li Li

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Self-Supervised Learning with Bi-Label Masked Speech Prediction for Streaming Multi-Talker Speech Recognition.