Author of the publication

Flickr30k Entities: Collecting Region-to-Phrase Correspondences for Richer Image-to-Sentence Models.

, , , , , and . ICCV, page 2641-2649. IEEE Computer Society, (2015)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Text-to-image Editing by Image Information Removal., , , and . CoRR, (2023)Learning to Reason from General Concepts to Fine-grained Tokens for Discriminative Phrase Detection., and . CoRR, (2021)From Fake to Real (FFR): A two-stage training pipeline for mitigating spurious correlations with synthetic data., , and . CoRR, (2023)Collecting The Puzzle Pieces: Disentangled Self-Driven Human Pose Transfer by Permuting Textures., , and . CoRR, (2022)MixtureGrowth: Growing Neural Networks by Recombining Learned Parameters., , , and . CoRR, (2023)Show and Write: Entity-aware News Generation with Image Information., , and . CoRR, (2021)CDS: Cross-Domain Self-supervised Pre-training., , , , , and . ICCV, page 9103-9112. IEEE, (2021)Detecting Cross-Modal Inconsistency to Defend Against Neural Fake News., , and . EMNLP (1), page 2081-2106. Association for Computational Linguistics, (2020)Look at What I'm Doing: Self-Supervised Spatial Grounding of Narrations in Instructional Videos., , , , and . NeurIPS, page 14476-14487. (2021)A Dataset for Interactive Vision-Language Navigation with Unknown Command Feasibility., , , , , and . ECCV (8), volume 13668 of Lecture Notes in Computer Science, page 312-328. Springer, (2022)