Author of the publication

Unified Modeling of Multi-Talker Overlapped Speech Recognition and Diarization with a Sidecar Separator.

, , , , , and . INTERSPEECH, page 3467-3471. ISCA, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Multi-distribution deep belief network for speech synthesis., , and . ICASSP, page 8012-8016. IEEE, (2013)Design and Collection of an L2 English Corpus with a Suprasegmental Focus for Chinese Learners of English., , , , , and . ICPhS, page 1210-1213. (2011)Acoustic to articulatory mapping with deep neural network., , , , and . Multim. Tools Appl., 74 (22): 9889-9907 (2015)MSMC-TTS: Multi-Stage Multi-Codebook VQ-VAE Based Neural TTS., , , , and . IEEE ACM Trans. Audio Speech Lang. Process., (2023)SnakeGAN: A Universal Vocoder Leveraging DDSP Prior Knowledge and Periodic Inductive Bias., , , , , , , and . ICME, page 1703-1708. IEEE, (2023)TalkTive: A Conversational Agent Using Backchannels to Engage Older Adults in Neurocognitive Disorders Screening., , , , , , and . CHI, page 304:1-304:19. ACM, (2022)Push-Pull: Characterizing the Adversarial Robustness for Audio-Visual Active Speaker Detection., , , , and . SLT, page 692-699. IEEE, (2022)A Cantonese Speech-Driven Talking Face Using Translingual Audio-to-Visual Conversion., , and . ISCSLP, volume 4274 of Lecture Notes in Computer Science, page 627-639. Springer, (2006)User Satisfaction Estimation with Sequential Dialogue Act Modeling in Goal-oriented Conversational Systems., , , , and . WWW, page 2998-3008. ACM, (2022)Towards Expressive Speaking Style Modelling with Hierarchical Context Information for Mandarin Speech Synthesis., , , , , and . ICASSP, page 7922-7926. IEEE, (2022)