Author of the publication

Localization vs. Semantics: Visual Representations in Unimodal and Multimodal Models.

, , , and . EACL (1), page 2378-2390. Association for Computational Linguistics, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Localization vs. Semantics: How Can Language Benefit Visual Representation Learning?, , , and . CoRR, (2022)Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models., , , , and . CoRR, (2023)SwapMix: Diagnosing and Regularizing the Over-Reliance on Visual Context in Visual Question Answering., , , , , and . CVPR, page 5068-5078. IEEE, (2022)3D-Aware Visual Question Answering about Parts, Poses and Occlusions., , , , and . CoRR, (2023)Visual Commonsense in Pretrained Unimodal and Multimodal Models., , , and . NAACL-HLT, page 5321-5335. Association for Computational Linguistics, (2022)Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning., , , , , , and . CoRR, (2022)Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real Images., , , , , , and . ICCV, page 14890-14899. IEEE, (2021)Localization vs. Semantics: Visual Representations in Unimodal and Multimodal Models., , , and . EACL (1), page 2378-2390. Association for Computational Linguistics, (2024)Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning., , , , , , and . CVPR, page 14963-14973. IEEE, (2023)Context-Aware Group Captioning via Self-Attention and Contrastive Features., , , , and . CVPR, page 3437-3447. Computer Vision Foundation / IEEE, (2020)