Author of the publication

Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models.

, , , , , and . NAACL-HLT, page 2443-2459. Association for Computational Linguistics, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Multi-modal Self-Supervision from Generalized Data Transformations., , , , , and . CoRR, (2020)Multilingual Multimodal Pre-training for Zero-Shot Cross-Lingual Transfer of Vision-Language Models., , , , , and . NAACL-HLT, page 2443-2459. Association for Computational Linguistics, (2021)On Compositions of Transformations in Contrastive Self-Supervised Learning., , , , , , and . ICCV, page 9557-9567. IEEE, (2021)Learning and interpreting deep representations from multi-modal data. University of Oxford, UK, (2021)British Library, EThOS.Space-Time Crop & Attend: Improving Cross-modal Video Representation Learning., , , , , , and . ICCV, page 10540-10552. IEEE, (2021)Understanding Deep Networks via Extremal Perturbations and Smooth Masks., , and . ICCV, page 2950-2958. IEEE, (2019)Keeping Your Eye on the Ball: Trajectory Attention in Video Transformers., , , , , , , and . NeurIPS, page 12493-12506. (2021)Labelling unlabelled videos from scratch with multi-modal self-supervision., , , and . NeurIPS, (2020)Support-set bottlenecks for video-text representation learning., , , , , , and . ICLR, OpenReview.net, (2021)