From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Towards a Complete Benchmark on Video Moment Localization., , , , , , , , , и . AISTATS, том 238 из Proceedings of Machine Learning Research, стр. 4168-4176. PMLR, (2024)CXR-CLIP: Toward Large Scale Chest X-ray Language-Image Pre-training., , , , , , , и . MICCAI (2), том 14221 из Lecture Notes in Computer Science, стр. 101-111. Springer, (2023)Sparse DETR: Efficient End-to-End Object Detection with Learnable Sparsity., , , и . ICLR, OpenReview.net, (2022)Accelerating Object Detection by Erasing Background Activations., , , и . CoRR, (2020)Noise-aware Learning from Web-crawled Image-Text Data for Image Captioning., , , и . ICCV, стр. 2930-2940. IEEE, (2023)MELTR: Meta Loss Transformer for Learning to Fine-tune Video Foundation Models., , , , , и . CVPR, стр. 20105-20115. IEEE, (2023)Large Language Models are Temporal and Causal Reasoners for Video Question Answering., , , , и . EMNLP, стр. 4300-4316. Association for Computational Linguistics, (2023)Efficient Multilingual Multi-modal Pre-training through Triple Contrastive Loss., , , , и . COLING, стр. 5730-5744. International Committee on Computational Linguistics, (2022)Learning to Generate Text-Grounded Mask for Open-World Semantic Segmentation from Only Image-Text Pairs., , и . CVPR, стр. 11165-11174. IEEE, (2023)Spatially Consistent Representation Learning., , , и . CVPR, стр. 1144-1153. Computer Vision Foundation / IEEE, (2021)