Author of the publication

Improving Vision-and-Language Navigation with Image-Text Pairs from the Web.

, , , , , and . ECCV (6), volume 12351 of Lecture Notes in Computer Science, page 259-274. Springer, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Emergence of Compositional Language with Deep Generational Transmission., , , , and . CoRR, (2019)Graph R-CNN for Scene Graph Generation., , , , and . ECCV (1), volume 11205 of Lecture Notes in Computer Science, page 690-706. Springer, (2018)DD-PPO: Learning Near-Perfect PointGoal Navigators from 2.5 Billion Frames., , , , , , , and . ICLR, OpenReview.net, (2020)Are We Making Real Progress in Simulated Environments? Measuring the Sim2Real Gap in Embodied Visual Navigation., , , , , , , , and . CoRR, (2019)12-in-1: Multi-Task Vision and Language Representation Learning., , , , and . CoRR, (2019)Diverse Beam Search for Improved Description of Complex Scenes., , , , , , and . AAAI, page 7371-7379. AAAI Press, (2018)Sim-to-Real Transfer for Vision-and-Language Navigation., , , , , , and . CoRL, volume 155 of Proceedings of Machine Learning Research, page 671-681. PMLR, (2020)12-in-1: Multi-Task Vision and Language Representation Learning., , , , and . CVPR, page 10434-10443. Computer Vision Foundation / IEEE, (2020)Dialog without Dialog Data: Learning Visual Dialog Agents from VQA Data., , , , , and . NeurIPS, (2020)SOAT: A Scene- and Object-Aware Transformer for Vision-and-Language Navigation., , , , and . NeurIPS, page 7357-7367. (2021)