Author of the publication

Multimodal Graph Transformer for Multimodal Question Answering.

, and . EACL, page 189-200. Association for Computational Linguistics, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

COVID-CT-Dataset: A CT Scan Dataset about COVID-19., , , and . CoRR, (2020)Mastering Robot Manipulation with Multimodal Prompts through Pretraining and Multi-task Fine-tuning., , , , , , , , and . CoRR, (2023)Towards Visual Question Answering on Pathology Images., , , , , , and . ACL/IJCNLP (2), page 708-718. Association for Computational Linguistics, (2021)Worse than Random? An Embarrassingly Simple Probing Evaluation of Large Multimodal Models in Medical VQA., , , and . CoRR, (2024)Learned Turbo-type Affine Rank Minimization., , and . WCSP, page 1-7. IEEE, (2019)Training-Free Structured Diffusion Guidance for Compositional Text-to-Image Synthesis., , , , , , , , and . ICLR, OpenReview.net, (2023)PathVQA: 30000+ Questions for Medical Visual Question Answering., , , , and . CoRR, (2020)ComCLIP: Training-Free Compositional Image and Text Matching., , , and . NAACL-HLT, page 6639-6659. Association for Computational Linguistics, (2024)CPL: Counterfactual Prompt Learning for Vision and Language Models., , , , , , , , , and . CoRR, (2022)Discriminative Diffusion Models as Few-shot Vision and Language Learners., , , , , , , , and . CoRR, (2023)