Author of the publication

End-to-End Learning of Visual Representations From Uncurated Instructional Videos.

, , , , , and . CVPR, page 9876-9886. Computer Vision Foundation / IEEE, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

BodyNet: Volumetric Inference of 3D Human Body Shapes., , , , , , and . ECCV (7), volume 11211 of Lecture Notes in Computer Science, page 20-38. Springer, (2018)Learning to combine primitive skills: A step towards versatile robotic manipulation §., , , , , and . ICRA, page 4637-4643. IEEE, (2020)Margin based knowledge distillation for mobile face recognition., , and . ICMV, volume 11433 of SPIE Proceedings, page 114330O. SPIE, (2019)Think Global, Act Local: Dual-scale Graph Transformer for Vision-and-Language Navigation., , , , and . CVPR, page 16516-16526. IEEE, (2022)Learning from Unlabeled 3D Environments for Vision-and-Language Navigation., , , , and . ECCV (39), volume 13699 of Lecture Notes in Computer Science, page 638-655. Springer, (2022)MirrorCheck: Efficient Adversarial Defense for Vision-Language Models., , , , , , , and . CoRR, (2024)Density-aware person detection and tracking in crowds., , , and . ICCV, page 2423-2430. IEEE Computer Society, (2011)Just Ask: Learning to Answer Questions from Millions of Narrated Videos., , , , and . CoRR, (2020)GenHowTo: Learning to Generate Actions and State Transformations from Instructional Videos., , , , and . CVPR, page 6561-6571. IEEE, (2024)Monte-Carlo Tree Search for Efficient Visually Guided Rearrangement Planning., , , , , , and . CoRR, (2019)