From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

SALMONN: Towards Generic Hearing Abilities for Large Language Models., , , , , , , , и . CoRR, (2023)Connecting Speech Encoder and Large Language Model for ASR., , , , , , , , и . CoRR, (2023)Connecting Speech Encoder and Large Language Model for ASR., , , , , , , , и . ICASSP, стр. 12637-12641. IEEE, (2024)M³AV: A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset., , , , , , , , и . ACL (1), стр. 9041-9060. Association for Computational Linguistics, (2024)A method of band selection of remote sensing image based on clustering and intra-class index., , и . Multim. Tools Appl., 81 (16): 22111-22128 (2022)HMDN: Hierarchical Multi-Distribution Network for Click-Through Rate Prediction., , , , , , , и . CoRR, (2024)T2T-YAO: A Telomere-to-Telomere Assembled Diploid Reference Genome for Han Chinese., , , , , , , , , и 32 other автор(ы). Genom. Proteom. Bioinform., 21 (6): 1085-1100 (2023)Can Large Language Models Understand Spatial Audio?, , , , , , , , , и 1 other автор(ы). CoRR, (2024)Extending Large Language Models for Speech and Audio Captioning., , , , , , , , и . ICASSP, стр. 11236-11240. IEEE, (2024)video-SALMONN: Speech-Enhanced Audio-Visual Large Language Models., , , , , , , , , и . ICML, OpenReview.net, (2024)