Author of the publication

X-InstructBLIP: A Framework for aligning X-Modal instruction-aware representations to LLMs and Emergent Cross-modal Reasoning.

, , , , , , , , , and . CoRR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A General Framework for Tracking Multiple People from a Moving Camera., , and . IEEE Trans. Pattern Anal. Mach. Intell., 35 (7): 1577-1591 (2013)iGibson, a Simulation Environment for Interactive Tasks in Large Realistic Scenes., , , , , , , , , and 4 other author(s). CoRR, (2020)Semantic and Geometric Modeling with Neural Message Passing in 3D Scene Graphs for Hierarchical Mechanical Search., , , , and . CoRR, (2020)Which Tasks Should Be Learned Together in Multi-task Learning?, , , , , and . CoRR, (2019)Topological Planning with Transformers for Vision-and-Language Navigation., , , , and . CoRR, (2020)Sonicverse: A Multisensory Simulation Platform for Embodied Household Agents that See and Hear., , , , , , , , and . ICRA, page 704-711. IEEE, (2023)Robust Policies via Mid-Level Visual Representations: An Experimental Study in Manipulation and Navigation., , , , , , , and . CoRL, volume 155 of Proceedings of Machine Learning Research, page 2328-2346. PMLR, (2020)A Behavioral Approach to Visual Navigation with Graph Localization Networks., , , , , , and . Robotics: Science and Systems, (2019)Taskonomy: Disentangling Task Transfer Learning., , , , , and . CVPR, page 3712-3722. Computer Vision Foundation / IEEE Computer Society, (2018)Social GAN: Socially Acceptable Trajectories With Generative Adversarial Networks., , , , and . CVPR, page 2255-2264. Computer Vision Foundation / IEEE Computer Society, (2018)