From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

GIT: A Generative Image-to-text Transformer for Vision and Language., , , , , , , , и . Trans. Mach. Learn. Res., (2022)Multiple Z-Complementary Code Sets With Low Inter-Set Cross-Correlation., , , и . IWSDA, стр. 1-5. IEEE, (2022)Meta Module Network for Compositional Visual Reasoning., , , , , и . WACV, стр. 655-664. IEEE, (2021)Idea2Img: Iterative Self-Refinement with GPT-4V(ision) for Automatic Image Design and Generation., , , , , , и . CoRR, (2023)DEsignBench: Exploring and Benchmarking DALL-E 3 for Imagining Visual Design., , , , и . CoRR, (2023)Extracting Human Face Similarity Judgments: Pairs or Triplets?, , , и . CogSci, cognitivesciencesociety.org, (2016)Multi-step Reasoning via Recurrent Dual Attention for Visual Dialog., , , , , и . ACL (1), стр. 6463-6474. Association for Computational Linguistics, (2019)An Empirical Study of Multimodal Model Merging., , , , , и . EMNLP (Findings), стр. 1563-1575. Association for Computational Linguistics, (2023)MM-Vet: Evaluating Large Multimodal Models for Integrated Capabilities., , , , , , , и . ICML, OpenReview.net, (2024)Generalized Decoding for Pixel, Image, and Language., , , , , , , , , и 4 other автор(ы). CoRR, (2022)