Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Serialized Output Training for End-to-End Overlapped Speech Recognition.

N. Kanda, Y. Gaur, X. Wang, Z. Meng, and T. Yoshioka. INTERSPEECH, page 2797-2801. ISCA, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Akira Kanda

Jorge Kanda

Kanda Romruen

Mahesh Kandasamy

Admire M Kandawasvika

Other publications of authors with the same name

A two-layer model for behavior and dialogue planning in conversational service robots.M. Nakano, Y. Hasegawa, K. Nakadai, T. Nakamura, J. Takeuchi, T. Torii, H. Tsujino, N. Kanda, and H. Okuno. IROS, page 3329-3335. IEEE, (2005)Multiple index combination for Japanese spoken term detection with optimum index selection based on OOV-region classifier.N. Kanda, K. Itoyama, and H. Okuno. ICASSP, page 8540-8544. IEEE, (2013)Microsoft Speaker Diarization System for the VoxCeleb Speaker Recognition Challenge 2020.X. Xiao, N. Kanda, Z. Chen, T. Zhou, T. Yoshioka, S. Chen, Y. Zhao, G. Liu, Y. Wu, J. Wu and 3 other author(s). CoRR, (2020)Open-vocabulary keyword detection from super-large scale speech database.N. Kanda, H. Sagawa, T. Sumiyoshi, and Y. Obuchi. MMSP, page 939-944. IEEE Signal Processing Society, (2008)Making Flow-Matching-Based Zero-Shot Text-to-Speech Laugh as You Like.N. Kanda, X. Wang, S. Eskimez, M. Thakker, H. Yang, Z. Zhu, M. Tang, C. Li, C. Tsai, Z. Xiao and 5 other author(s). CoRR, (2024)Investigation of End-To-End Speaker-Attributed ASR for Continuous Multi-Talker Recordings.N. Kanda, X. Chang, Y. Gaur, X. Wang, Z. Meng, Z. Chen, and T. Yoshioka. CoRR, (2020)Leveraging Timestamp Information for Serialized Joint Streaming Recognition and Translation.S. Papi, P. Wang, J. Chen, J. Xue, N. Kanda, J. Li, and Y. Gaur. CoRR, (2023)Streaming Multi-Talker ASR with Token-Level Serialized Output Training.N. Kanda, J. Wu, Y. Wu, X. Xiao, Z. Meng, X. Wang, Y. Gaur, Z. Chen, J. Li, and T. Yoshioka. INTERSPEECH, page 3774-3778. ISCA, (2022)Streaming Multi-Talker Speech Recognition with Joint Speaker Identification.L. Lu, N. Kanda, J. Li, and Y. Gong. Interspeech, page 1782-1786. ISCA, (2021)Maximum a posteriori Based Decoding for CTC Acoustic Models.N. Kanda, X. Lu, and H. Kawai. INTERSPEECH, page 1868-1872. ISCA, (2016)

BibSonomy

Disambiguation of "Kanda, Naoyuki"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Serialized Output Training for End-to-End Overlapped Speech Recognition.

Please choose a person to relate this publication to

Akira Kanda

Jorge Kanda

Kanda Romruen

Mahesh Kandasamy

Admire M Kandawasvika

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Kanda, Naoyuki"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Serialized Output Training for End-to-End Overlapped Speech Recognition.

Please choose a person to relate this publication to

Akira Kanda

Jorge Kanda

Kanda Romruen

Mahesh Kandasamy

Admire M Kandawasvika

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Serialized Output Training for End-to-End Overlapped Speech Recognition.