Author of the publication

Serialized Output Training for End-to-End Overlapped Speech Recognition.

, , , , and . INTERSPEECH, page 2797-2801. ISCA, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A two-layer model for behavior and dialogue planning in conversational service robots., , , , , , , , and . IROS, page 3329-3335. IEEE, (2005)Multiple index combination for Japanese spoken term detection with optimum index selection based on OOV-region classifier., , and . ICASSP, page 8540-8544. IEEE, (2013)Microsoft Speaker Diarization System for the VoxCeleb Speaker Recognition Challenge 2020., , , , , , , , , and 3 other author(s). CoRR, (2020)Open-vocabulary keyword detection from super-large scale speech database., , , and . MMSP, page 939-944. IEEE Signal Processing Society, (2008)Making Flow-Matching-Based Zero-Shot Text-to-Speech Laugh as You Like., , , , , , , , , and 5 other author(s). CoRR, (2024)Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings., , , , , , and . CoRR, (2020)Leveraging Timestamp Information for Serialized Joint Streaming Recognition and Translation., , , , , , and . CoRR, (2023)Streaming Multi-Talker ASR with Token-Level Serialized Output Training., , , , , , , , , and . INTERSPEECH, page 3774-3778. ISCA, (2022)Streaming Multi-Talker Speech Recognition with Joint Speaker Identification., , , and . Interspeech, page 1782-1786. ISCA, (2021)Maximum a posteriori Based Decoding for CTC Acoustic Models., , and . INTERSPEECH, page 1868-1872. ISCA, (2016)