Author of the publication

Revisiting Multimodal Representation in Contrastive Learning: From Patch and Token Embeddings to Finite Discrete Tokens.

, , , , , , , and . CVPR, page 15095-15104. IEEE, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

CLIP-Adapter: Better Vision-Language Models with Feature Adapters., , , , , , , and . Int. J. Comput. Vis., 132 (2): 581-595 (February 2024)CLIP-Adapter: Better Vision-Language Models with Feature Adapters., , , , , , , and . CoRR, (2021)Multi-Layer Content Interaction Through Quaternion Product for Visual Question Answering., , , , , , and . ICASSP, page 4412-4416. IEEE, (2020)Semantic segmentation with multi-path refinement and pyramid pooling dilated-resnet., , , , , and . ICIP, page 3100-3104. IEEE, (2017)COMPOSER: Compositional Reasoning of Group Activity in Videos with Keypoint-Only Modality., , , , , , , , and . ECCV (35), volume 13695 of Lecture Notes in Computer Science, page 249-266. Springer, (2022)Dense Contrastive Visual-Linguistic Pretraining., , , , , , , and . ACM Multimedia, page 5203-5212. ACM, (2021)InfiCoder-Eval: Systematically Evaluating the Question-Answering Capabilities of Code Large Language Models., , , , , , , , , and . CoRR, (2024)Image Segmentation with Pyramid Dilated Convolution Based on ResNet and U-Net., , , , and . ICONIP (2), volume 10635 of Lecture Notes in Computer Science, page 364-372. Springer, (2017)Multi-Pass Transformer for Machine Translation., , , , and . CoRR, (2020)Counterfactual Evaluation for Explainable AI., , , , , , , , and . CoRR, (2021)