Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Localization vs. Semantics: Visual Representations in Unimodal and Multimodal Models.

Z. Li, C. Xie, B. Durme, and A. Yuille. EACL (1), page 2378-2390. Association for Computational Linguistics, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Li Li

Other publications of authors with the same name

Localization vs. Semantics: How Can Language Benefit Visual Representation Learning?Z. Li, C. Xie, B. Durme, and A. Yuille. CoRR, (2022)Causal-CoG: A Causal-Effect Look at Context Generation for Boosting Multi-modal Language Models.S. Zhao, Z. Li, Y. Lu, A. Yuille, and Y. Wang. CoRR, (2023)SwapMix: Diagnosing and Regularizing the Over-Reliance on Visual Context in Visual Question Answering.V. Gupta, Z. Li, A. Kortylewski, C. Zhang, Y. Li, and A. Yuille. CVPR, page 5068-5078. IEEE, (2022)3D-Aware Visual Question Answering about Parts, Poses and Occlusions.X. Wang, W. Ma, Z. Li, A. Kortylewski, and A. Yuille. CoRR, (2023)Visual Commonsense in Pretrained Unimodal and Multimodal Models.C. Zhang, B. Durme, Z. Li, and E. Stengel-Eskin. NAACL-HLT, page 5321-5335. Association for Computational Linguistics, (2022)Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning.Z. Li, X. Wang, E. Stengel-Eskin, A. Kortylewski, W. Ma, B. Durme, and A. Yuille. CoRR, (2022)Calibrating Concepts and Operations: Towards Symbolic Reasoning on Real Images.Z. Li, E. Stengel-Eskin, Y. Zhang, C. Xie, Q. Tran, B. Durme, and A. Yuille. ICCV, page 14890-14899. IEEE, (2021)Localization vs. Semantics: Visual Representations in Unimodal and Multimodal Models.Z. Li, C. Xie, B. Durme, and A. Yuille. EACL (1), page 2378-2390. Association for Computational Linguistics, (2024)Super-CLEVR: A Virtual Benchmark to Diagnose Domain Robustness in Visual Reasoning.Z. Li, X. Wang, E. Stengel-Eskin, A. Kortylewski, W. Ma, B. Durme, and A. Yuille. CVPR, page 14963-14973. IEEE, (2023)Context-Aware Group Captioning via Self-Attention and Contrastive Features.Z. Li, Q. Tran, L. Mai, Z. Lin, and A. Yuille. CVPR, page 3437-3447. Computer Vision Foundation / IEEE, (2020)

BibSonomy

Disambiguation of "Li, Zhuowan"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Localization vs. Semantics: Visual Representations in Unimodal and Multimodal Models.

Please choose a person to relate this publication to

Li Li

Li Li

Li Li

Li Li

Li Li

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Li, Zhuowan"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Localization vs. Semantics: Visual Representations in Unimodal and Multimodal Models.

Please choose a person to relate this publication to

Li Li

Li Li

Li Li

Li Li

Li Li

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Localization vs. Semantics: Visual Representations in Unimodal and Multimodal Models.