@dblp

Two Effects, One Trigger: On the Modality Gap, Object Bias, and Information Imbalance in Contrastive Vision-Language Representation Learning.

, , , , and . CoRR, (2024)

Links and resources

Tags