Author of the publication

Understanding Image and Text Simultaneously: a Dual Vision-Language Machine Comprehension Task.

, , , and . CoRR, (2016)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

PaLI-3 Vision Language Models: Smaller, Faster, Stronger., , , , , , , , , and 9 other author(s). CoRR, (2023)PreSTU: Pre-Training for Scene-Text Understanding., , , , , , and . CoRR, (2022)Understanding Guided Image Captioning Performance across Domains., , , and . CoRR, (2020)Understanding Image and Text Simultaneously: a Dual Vision-Language Machine Comprehension Task., , , and . CoRR, (2016)Multimodal Pretraining for Dense Video Captioning., , , , and . AACL/IJCNLP, page 470-490. Association for Computational Linguistics, (2020)The SDL Language Weaver Systems in the WMT12 Quality Estimation Shared Task., , and . WMT@NAACL-HLT, page 145-151. The Association for Computer Linguistics, (2012)978-1-937284-20-6.Denoising Large-Scale Image Captioning from Alt-text Data Using Content Selection Models., , , , and . COLING, page 6089-6104. International Committee on Computational Linguistics, (2022)Cross-modal Language Generation using Pivot Stabilization for Web-scale Language Coverage., and . ACL, page 160-170. Association for Computational Linguistics, (2020)Natural Language Generation for Text-to-Text Applications Using an Information-Slim Representation.. AAAI, page 1662-1663. AAAI Press / The MIT Press, (2005)Gemini 1.5: Unlocking multimodal understanding across millions of tokens of context., , , , , , , , , and 43 other author(s). CoRR, (2024)