Author of the publication

End-to-End Learning of Visual Representations From Uncurated Instructional Videos.

, , , , , and . CVPR, page 9876-9886. Computer Vision Foundation / IEEE, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Controllable Attention for Structured Layered Video Decomposition., , , and . ICCV, page 5733-5742. IEEE, (2019)Zorro: the masked multimodal transformer., , , , , , , , , and 1 other author(s). CoRR, (2023)End-to-End Learning of Visual Representations from Uncurated Instructional Videos., , , , , and . CoRR, (2019)Thinking Fast and Slow: Efficient Text-to-Visual Retrieval With Transformers., , , , and . CVPR, page 9826-9836. Computer Vision Foundation / IEEE, (2021)End-to-End Learning of Visual Representations From Uncurated Instructional Videos., , , , , and . CVPR, page 9876-9886. Computer Vision Foundation / IEEE, (2020)Perceiver IO: A General Architecture for Structured Inputs & Outputs., , , , , , , , , and 5 other author(s). CoRR, (2021)Multi-Task Learning of Object States and State-Modifying Actions From Web Videos., , , , and . IEEE Trans. Pattern Anal. Mach. Intell., 46 (7): 5114-5130 (2024)Learning to Segment Actions from Observation and Narration., , , , , and . ACL, page 2569-2588. Association for Computational Linguistics, (2020)Flamingo: a Visual Language Model for Few-Shot Learning., , , , , , , , , and 17 other author(s). NeurIPS, (2022)Are Labels Required for Improving Adversarial Robustness?, , , , , and . NeurIPS, page 12192-12202. (2019)