Author of the publication

Token-wise Training for Attention Based End-to-end Speech Recognition.

, , , and . ICASSP, page 6276-6280. IEEE, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Raw Waveform Encoder with Multi-Scale Globally Attentive Locally Recurrent Networks for End-to-End Speech Recognition., , , , and . Interspeech, page 316-320. ISCA, (2021)Enhancing Speaking Styles in Conversational Text-to-Speech Synthesis with Graph-Based Multi-Modal Context Modeling., , , , , , and . ICASSP, page 7917-7921. IEEE, (2022)Multi-Channel Speaker Diarization Using Spatial Features for Meetings., , , , , , and . ICASSP, page 7337-7341. IEEE, (2022)Discriminative Training Using Non-Uniform Criteria for Keyword Spotting on Spontaneous Speech., and . IEEE ACM Trans. Audio Speech Lang. Process., 23 (2): 300-312 (2015)Joint Modeling of Code-Switched and Monolingual ASR via Conditional Factorization., , , , , , , , and . CoRR, (2021)VARA-TTS: Non-Autoregressive Text-to-Speech Synthesis based on Very Deep VAE with Residual Attention., , , , , , and . CoRR, (2021)Diffsound: Discrete Diffusion Model for Text-to-sound Generation., , , , , , and . CoRR, (2022)Detect what you want: Target Sound Detection., , , and . CoRR, (2021)Improving Mandarin End-to-End Speech Recognition with Word N-gram Language Model., , , , and . CoRR, (2022)An investigation of neural uncertainty estimation for target speaker extraction equipped RNN transducer., , , , , and . Comput. Speech Lang., (2022)