Author of the publication

Bytecover2: Towards Dimensionality Reduction of Latent Embedding for Efficient Cover Song Identification.

, , , , and . ICASSP, page 616-620. IEEE, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Unsupervised training of subspace gaussian mixture models for conversational telephone speech recognition., , and . ICASSP, page 4829-4832. IEEE, (2012)Improving RNN transducer with normalized jointer network., , , , , , , and . CoRR, (2020)Unsupervised Video Domain Adaptation: A Disentanglement Perspective., , , , , , and . CoRR, (2022)Improving Large-Scale Deep Biasing With Phoneme Features and Text-Only Data in Streaming Transducer., , , , , and . ASRU, page 1-8. IEEE, (2023)PPG-Based Singing Voice Conversion with Adversarial Representation Learning., , , , , , and . ICASSP, page 7073-7077. IEEE, (2021)A Chapter-Wise Understanding System for Text-To-Speech in Chinese Novels., , , , , and . ICASSP, page 6069-6073. IEEE, (2021)A Unified Sequence-to-Sequence Front-End Model for Mandarin Text-to-Speech Synthesis., , , , , , and . ICASSP, page 6689-6693. IEEE, (2020)Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation., , , , , , , , , and . CoRR, (2023)SALMONN: Towards Generic Hearing Abilities for Large Language Models., , , , , , , , and . CoRR, (2023)Improving Non-native Word-level Pronunciation Scoring with Phone-level Mixup Data Augmentation and Multi-source Information., , , , , and . CoRR, (2022)