Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

GPT-4V in Wonderland: Large Multimodal Models for Zero-Shot Smartphone GUI Navigation., , , , , , , , , and 2 other author(s). CoRR, (2023)Diagnosing Vision-and-Language Navigation: What Really Matters., , , , , , , , and . NAACL-HLT, page 5981-5993. Association for Computational Linguistics, (2022)Automatic Layout Planning for Visually-Rich Documents with Instruction-Following Models., , , , and . CoRR, (2024)Large Language Models Are Implicitly Topic Models: Explaining and Finding Good Demonstrations for In-Context Learning., , and . CoRR, (2023)End-to-end Dense Video Captioning as Sequence Generation., , , , and . COLING, page 5651-5665. International Committee on Computational Linguistics, (2022)Towards Understanding Sample Variance in Visually Grounded Language Generation: Evaluations and Observations., , , , , and . EMNLP (1), page 8806-8811. Association for Computational Linguistics, (2020)Texar: A Modularized, Versatile, and Extensible Toolkit for Text Generation., , , , , , , , , and 5 other author(s). ACL (3), page 159-164. Association for Computational Linguistics, (2019)LayoutGPT: Compositional Visual Planning and Generation with Large Language Models., , , , , , , , and . CoRR, (2023)Multimodal Text Style Transfer for Outdoor Vision-and-Language Navigation., , , , , , , and . EACL, page 1207-1221. Association for Computational Linguistics, (2021)List Items One by One: A New Data Source and Learning Paradigm for Multimodal LLMs., , , , , , , , , and 1 other author(s). CoRR, (2024)