From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

TRIPS: Efficient Vision-and-Language Pre-training with Text-Relevant Image Patch Selection., , , , , , , и . EMNLP, стр. 4084-4096. Association for Computational Linguistics, (2022)Similarity Learning For Cover Song Identification Using Cross-Similarity Matrices of Multi-Level Deep Sequences., , и . ICASSP, стр. 26-30. IEEE, (2020)BUS : Efficient and Effective Vision-language Pre-training with Bottom-Up Patch Summarization., , , , , , , , , и . ICCV, стр. 2888-2898. IEEE, (2023)TiMix: Text-Aware Image Mixing for Effective Vision-Language Pre-training., , , , , , и . AAAI, стр. 2489-2497. AAAI Press, (2024)Efficient Vision-and-Language Pre-training with Text-Relevant Image Patch Selection., , , , , , , , и . CoRR, (2024)COPA : Efficient Vision-Language Pre-training through Collaborative Object- and Patch-Text Alignment., , , , , , , , , и . ACM Multimedia, стр. 4480-4491. ACM, (2023)Learn A Robust Representation For Cover Song Identification Via Aggregating Local And Global Music Temporal Context., , и . ICME, стр. 1-6. IEEE, (2020)Vision Language Pre-training by Contrastive Learning with Cross-Modal Similarity Regulation., , , , , и . ACL (1), стр. 14660-14679. Association for Computational Linguistics, (2023)Exploiting Pseudo Image Captions for Multimodal Summarization., , , , и . ACL (Findings), стр. 161-175. Association for Computational Linguistics, (2023)