Author of the publication

Cross-Modal Discrete Representation Learning.

, , , , , and . ACL (1), page 3013-3035. Association for Computational Linguistics, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Non-Autoregressive Predictive Coding for Learning Speech Representations from Local Dependencies., , and . CoRR, (2020)Contrastive Audio-Visual Masked Autoencoder., , , , , , and . CoRR, (2022)On the Interplay Between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis., , , , , , , , , and 1 other author(s). CoRR, (2021)Sequence-to-sequence Automatic Speech Recognition with Word Embedding Regularization and Fused Decoding., , , , and . CoRR, (2019)Semi-Supervised Learning for Multi-Speaker Text-to-Speech Synthesis Using Discrete Speech Representation., , , and . INTERSPEECH, page 3191-3195. ISCA, (2020)Towards End-to-End Unsupervised Speech Recognition., , , and . SLT, page 221-228. IEEE, (2022)On the Interplay between Sparsity, Naturalness, Intelligibility, and Prosody in Speech Synthesis., , , , , , , , , and 1 other author(s). ICASSP, page 8447-8451. IEEE, (2022)Non-Autoregressive Predictive Coding for Learning Speech Representations from Local Dependencies., , and . Interspeech, page 3730-3734. ISCA, (2021)Listen, Think, and Understand., , , , and . CoRR, (2023)Codec-SUPERB: An In-Depth Analysis of Sound Codec Models., , , , , , , , , and . CoRR, (2024)