Author of the publication

Building an Open-Vocabulary Video CLIP Model With Better Architectures, Optimization and Data.

, , , , , , and . IEEE Trans. Pattern Anal. Mach. Intell., 46 (7): 4747-4762 (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Semi-supervised Vision Transformers., , , , and . ECCV (30), volume 13690 of Lecture Notes in Computer Science, page 605-620. Springer, (2022)Cross-domain Contrastive Learning for Unsupervised Domain Adaptation., , , , , and . CoRR, (2021)AID: Adapting Image2Video Diffusion Models for Instruction-guided Video Prediction., , , , and . CoRR, (2024)Semi-Supervised Vision Transformers., , , , and . CoRR, (2021)To See is to Believe: Prompting GPT-4V for Better Visual Instruction Tuning., , , , , and . CoRR, (2023)VideoLT: Large-scale Long-tailed Video Recognition., , , , , , and . ICCV, page 7940-7949. IEEE, (2021)Building an Open-Vocabulary Video CLIP Model With Better Architectures, Optimization and Data., , , , , , and . IEEE Trans. Pattern Anal. Mach. Intell., 46 (7): 4747-4762 (2024)BMB: Balanced Memory Bank for Imbalanced Semi-supervised Learning., , , and . CoRR, (2023)A Multimodal Framework for Video Ads Understanding., , , , and . ACM Multimedia, page 4843-4847. ACM, (2021)Open-VCLIP: Transforming CLIP to an Open-vocabulary Video Model via Interpolated Weight Optimization., , , , and . ICML, volume 202 of Proceedings of Machine Learning Research, page 36978-36989. PMLR, (2023)