Author of the publication

Multimodal Graph Transformer for Multimodal Question Answering.

, and . EACL, page 189-200. Association for Computational Linguistics, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Language-Driven Image Style Transfer., , and . CoRR, (2021)Evaluating Multi-Agent Coordination Abilities in Large Language Models., , and . CoRR, (2023)Assessing Multilingual Fairness in Pre-trained Multimodal Representations., , and . ACL (Findings), page 2681-2695. Association for Computational Linguistics, (2022)Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis., , , , , , , , and . ICLR, OpenReview.net, (2023)SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing., , , , , , , , , and . CoRR, (2024)PHOTOSWAP: Personalized Subject Swapping in Images., , , , , , , , , and 1 other author(s). NeurIPS, (2023)MiniGPT-5: Interleaved Vision-and-Language Generation via Generative Vokens., , and . CoRR, (2023)Visual Question Rewriting for Increasing Response Rate., , , and . SIGIR, page 2071-2075. ACM, (2021)Are Gender-Neutral Queries Really Gender-Neutral? Mitigating Gender Bias in Image Search., , and . EMNLP (1), page 1995-2008. Association for Computational Linguistics, (2021)FedVLN: Privacy-Preserving Federated Vision-and-Language Navigation., and . ECCV (36), volume 13696 of Lecture Notes in Computer Science, page 682-699. Springer, (2022)