Author of the publication

TVQA: Localized, Compositional Video Question Answering.

, , , and . EMNLP, page 1369-1379. Association for Computational Linguistics, (2018)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Dance Dance Generation: Motion Transfer for Internet Videos., , , , and . ICCV Workshops, page 1208-1216. IEEE, (2019)VALUE: A Multi-Task Benchmark for Video-and-Language Understanding Evaluation., , , , , , , , , and 5 other author(s). NeurIPS Datasets and Benchmarks, (2021)ReferItGame: Referring to Objects in Photographs of Natural Scenes., , , and . EMNLP, page 787-798. ACL, (2014)Revealing Single Frame Bias for Video-and-Language Learning., , and . ACL (1), page 487-507. Association for Computational Linguistics, (2023)Parsing clothing in fashion photographs., , , and . CVPR, page 3570-3577. IEEE Computer Society, (2012)Who are you with and where are you going?, , , and . CVPR, page 1345-1352. IEEE Computer Society, (2011)Automatic Attribute Discovery and Characterization from Noisy Web Data., , and . ECCV (1), volume 6311 of Lecture Notes in Computer Science, page 663-676. Springer, (2010)CommerceMM: Large-Scale Commerce MultiModal Representation Learning with Omni Retrieval., , , , , , and . KDD, page 4433-4442. ACM, (2022)Visual to Sound: Generating Natural Sound for Videos in the Wild., , , , and . CVPR, page 3550-3558. Computer Vision Foundation / IEEE Computer Society, (2018)Auto-Illustrating Poems and Songs with Style., , and . ACCV (4), volume 10114 of Lecture Notes in Computer Science, page 87-103. Springer, (2016)