Author of the publication

S3T: Self-Supervised Pre-Training with Swin Transformer For Music Classification.

, , , , and . ICASSP, page 606-610. IEEE, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A Unified Sequence-to-Sequence Front-End Model for Mandarin Text-to-Speech Synthesis., , , , , , and . ICASSP, page 6689-6693. IEEE, (2020)Unsupervised training of subspace gaussian mixture models for conversational telephone speech recognition., , and . ICASSP, page 4829-4832. IEEE, (2012)Improving RNN transducer with normalized jointer network., , , , , , , and . CoRR, (2020)Unsupervised Video Domain Adaptation: A Disentanglement Perspective., , , , , , and . CoRR, (2022)Improving Large-Scale Deep Biasing With Phoneme Features and Text-Only Data in Streaming Transducer., , , , , and . ASRU, page 1-8. IEEE, (2023)A Chapter-Wise Understanding System for Text-To-Speech in Chinese Novels., , , , , and . ICASSP, page 6069-6073. IEEE, (2021)PPG-Based Singing Voice Conversion with Adversarial Representation Learning., , , , , , and . ICASSP, page 7073-7077. IEEE, (2021)SALMONN: Towards Generic Hearing Abilities for Large Language Models., , , , , , , , and . CoRR, (2023)Make-An-Audio 2: Temporal-Enhanced Text-to-Audio Generation., , , , , , , , , and . CoRR, (2023)BiFSMNv2: Pushing Binary Neural Networks for Keyword Spotting to Real-Network Performance., , , , , , , , and . CoRR, (2022)