Author of the publication

Thinking Fast and Slow: Efficient Text-to-Visual Retrieval With Transformers.

, , , , and . CVPR, page 9826-9836. Computer Vision Foundation / IEEE, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Three ways to improve feature alignment for open vocabulary detection., , , , , and . CoRR, (2023)Zorro: the masked multimodal transformer., , , , , , , , , and 1 other author(s). CoRR, (2023)Controllable Attention for Structured Layered Video Decomposition., , , and . ICCV, page 5733-5742. IEEE, (2019)Decoupling the Role of Data, Attention, and Losses in Multimodal Transformers., , , , and . Trans. Assoc. Comput. Linguistics, (2021)Gemini: A Family of Highly Capable Multimodal Models., , , , , , , , , and 42 other author(s). CoRR, (2023)End-to-End Learning of Visual Representations from Uncurated Instructional Videos., , , , , and . CoRR, (2019)End-to-End Learning of Visual Representations From Uncurated Instructional Videos., , , , , and . CVPR, page 9876-9886. Computer Vision Foundation / IEEE, (2020)Perceiver IO: A General Architecture for Structured Inputs & Outputs., , , , , , , , , and 5 other author(s). CoRR, (2021)Multi-Task Learning of Object States and State-Modifying Actions From Web Videos., , , , and . IEEE Trans. Pattern Anal. Mach. Intell., 46 (7): 5114-5130 (2024)Thinking Fast and Slow: Efficient Text-to-Visual Retrieval With Transformers., , , , and . CVPR, page 9826-9836. Computer Vision Foundation / IEEE, (2021)