Author of the publication

Self-Supervised Learning with Bi-Label Masked Speech Prediction for Streaming Multi-Talker Speech Recognition.

, , , , , , , , and . ICASSP, page 1-5. IEEE, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Survey and evaluation of monocular visual-inertial SLAM algorithms for augmented reality., , , , , and . Virtual Real. Intell. Hardw., 1 (4): 386-410 (2019)Building High-Accuracy Multilingual ASR With Gated Language Experts and Curriculum Training., , , , , , , , , and 1 other author(s). ASRU, page 1-7. IEEE, (2023)Continuous Speech Separation with Conformer., , , , , , , , and . ICASSP, page 5749-5753. IEEE, (2021)LongFNT: Long-Form Speech Recognition with Factorized Neural Transducer., , , , , , and . ICASSP, page 1-5. IEEE, (2023)Fast and Accurate Factorized Neural Transducer for Text Adaption of End-to-End Speech Recognition Models., , , , and . ICASSP, page 1-5. IEEE, (2023)Endpoint Detection for Streaming End-to-End Multi-Talker ASR., , and . ICASSP, page 7312-7316. IEEE, (2022)Improving Self-Supervised Learning for Speech Recognition with Intermediate Layer Supervision., , , , , , and . ICASSP, page 7092-7096. IEEE, (2022)Listen, Look and Deliberate: Visual context-aware speech recognition using pre-trained text-video representations., , , and . CoRR, (2020)Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers., , , , , , , , , and 3 other author(s). CoRR, (2023)Enhanced Edge-Perceptual Guided Image Filtering.. CoRR, (2023)