Author of the publication

SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative Capabilities.

, , , , , , , , , , , , , , , , and . ACL (1), page 8479-8492. Association for Computational Linguistics, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

ESPnet-SPK: full pipeline speaker embedding toolkit with reproducible recipes, self-supervised front-ends, and off-the-shelf models., , , , , , , and . CoRR, (2024)Recent Developments on ESPnet Toolkit Boosted by Conformer., , , , , , , , , and 5 other author(s). CoRR, (2020)Exploration on HuBERT with Multiple Resolutions., , , , , and . CoRR, (2023)Reproducing Whisper-Style Training Using An Open-Source Toolkit And Publicly Available Data., , , , , , , , , and 6 other author(s). ASRU, page 1-8. IEEE, (2023)Context-Aware Goodness of Pronunciation for Computer-Assisted Pronunciation Training., , and . INTERSPEECH, page 3057-3061. ISCA, (2020)Blockwise Streaming Transformer for Spoken Language Understanding and Simultaneous Speech Translation., , , and . INTERSPEECH, page 1746-1750. ISCA, (2022)SingAug: Data Augmentation for Singing Voice Synthesis with Cycle-consistent Training Strategy., , , , and . INTERSPEECH, page 4272-4276. ISCA, (2022)Improving Massively Multilingual ASR with Auxiliary CTC Objectives., , , , , and . ICASSP, page 1-5. IEEE, (2023)Towards end-to-end Speaker Diarization with Generalized Neural Speaker Clustering., , , , and . ICASSP, page 8372-8376. IEEE, (2022)CMU's IWSLT 2023 Simultaneous Speech Translation System., , , , , , , and . IWSLT@ACL, page 235-240. Association for Computational Linguistics, (2023)