From post

SpeechUT: Bridging Speech and Text with Hidden-Unit for Encoder-Decoder Based Speech-Text Pre-training.

, , , , , , и . EMNLP, стр. 1663-1676. Association for Computational Linguistics, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

A consensus-based method for group decision making with incomplete uncertain linguistic preference relations., , и . Soft Comput., 23 (2): 669-682 (2019)Classification of MMG Signal Based on EMD., , , , , , и . LSMS/ICSEE (1), том 761 из Communications in Computer and Information Science, стр. 23-34. Springer, (2017)DOA-TDOA Hybrid Location Based on Weighted Least Squares with Observation Station Position Errors., , , и . ICCT, стр. 1777-1783. IEEE, (2022)Learning Contextually Fused Audio-Visual Representations For Audio-Visual Speech Recognition., , , , , и . ICIP, стр. 1346-1350. IEEE, (2022)A Two-Stage Method for Short-Wave Target Localization Using DOA and TDOA Measurements., , , , и . IEEE Access, (2023)Speed Synchronous Control of Dual-BLDCMs based on MRAC for a Hoist Application., , , , , и . IECON, стр. 1-6. IEEE, (2021)SpeechLM: Enhanced Speech Pre-Training With Unpaired Textual Data., , , , , , , , , и 1 other автор(ы). IEEE ACM Trans. Audio Speech Lang. Process., (2024)Fragility Index: A New Approach for Binary Classification., , , , , , , , , и . KDD, стр. 2918-2929. ACM, (2023)Joint Pre-Training with Speech and Bilingual Text for Direct Speech to Speech Translation., , , , , , , и . ICASSP, стр. 1-5. IEEE, (2023)Neural Codec Language Models are Zero-Shot Text to Speech Synthesizers., , , , , , , , , и 3 other автор(ы). CoRR, (2023)