Author of the publication

Large-Scale Self-Supervised Speech Representation Learning for Automatic Speaker Verification.

, , , , , , , and . ICASSP, page 6147-6151. IEEE, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Prompting Large Language Models for Zero-Shot Domain Adaptation in Speech Recognition., , , and . ASRU, page 1-8. IEEE, (2023)Optimizing Alignment of Speech and Language Latent Spaces for End-To-End Speech Recognition and Understanding., , , , , , and . ICASSP, page 7802-7806. IEEE, (2022)Improving Noise Robustness of Contrastive Speech Representation Learning with Speech Reconstruction., , , , , , , , and . ICASSP, page 6062-6066. IEEE, (2022)Neural Speech Synthesis with Transformer Network., , , , and . AAAI, page 6706-6713. AAAI Press, (2019)Semantic Mask for Transformer based End-to-End Speech Recognition., , , , , , , , , and . CoRR, (2019)RALL-E: Robust Codec Language Modeling with Chain-of-Thought Prompting for Text-to-Speech Synthesis., , , , , , , , , and 1 other author(s). CoRR, (2024)Microsoft Speaker Diarization System for the VoxCeleb Speaker Recognition Challenge 2020., , , , , , , , , and 3 other author(s). CoRR, (2020)On decoder-only architecture for speech-to-text and large language model integration., , , , , , , , , and 1 other author(s). CoRR, (2023)Semantic Mask for Transformer Based End-to-End Speech Recognition., , , , , , , , , and . INTERSPEECH, page 971-975. ISCA, (2020)Style Transfer as Unsupervised Machine Translation., , , , , , , and . CoRR, (2018)