Author of the publication

Fully-Hierarchical Fine-Grained Prosody Modeling For Interpretable Speech Synthesis.

, , , , , and . ICASSP, page 6264-6268. IEEE, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Speech-based Slot Filling using Large Language Models., , , , , and . CoRR, (2023)Transformer Language Models with LSTM-Based Cross-Utterance Information Representation., , and . ICASSP, page 7363-7367. IEEE, (2021)Building Better AI Agents: A Provocation on the Utilisation of Persona in LLM-based Conversational Agents., , and . CoRR, (2024)Parameter Efficient Finetuning for Speech Emotion Recognition and Domain Adaptation., , , and . ICASSP, page 10986-10990. IEEE, (2024)Speech-based Slot Filling using Large Language Models., , , , , and . ACL (Findings), page 6351-6362. Association for Computational Linguistics, (2024)Tree-constrained Pointer Generator with Graph Neural Network Encodings for Contextual Speech Recognition., , and . INTERSPEECH, page 2043-2047. ISCA, (2022)M3AV: A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset., , , , , , , , and . CoRR, (2024)SALMONN: Towards Generic Hearing Abilities for Large Language Models., , , , , , , , and . ICLR, OpenReview.net, (2024)Enhancing Quantised End-to-End ASR Models Via Personalisation., , , , and . ICASSP, page 12426-12430. IEEE, (2024)Fine-grained Audio-Visual Joint Representations for Multimodal Large Language Models., , , , , , , , and . CoRR, (2023)