Author of the publication

UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding.

, , , , , and . ACL (Findings), page 778-793. Association for Computational Linguistics, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

UniFine: A Unified and Fine-grained Approach for Zero-shot Vision-Language Understanding., , , , , and . ACL (Findings), page 778-793. Association for Computational Linguistics, (2023)i-Code V2: An Autoregressive Generation Framework over Vision, Language, and Speech Data., , , , , , , , , and 9 other author(s). NAACL-HLT (Findings), page 1615-1627. Association for Computational Linguistics, (2024)MAIRA-1: A specialised large multimodal model for radiology report generation., , , , , , , , , and 5 other author(s). CoRR, (2023)A patient-centric dataset of images and metadata for identifying melanomas using clinical context, , , , , , , , , and 15 other author(s). Scientific Data, (January 2021)i-Code: An Integrative and Composable Multimodal Learning Framework., , , , , , , , , and 10 other author(s). AAAI, page 10880-10890. AAAI Press, (2023)Streaming Video Model., , , , , and . CVPR, page 14602-14612. IEEE, (2023)Deep Learning, Sparse Coding, and SVM for Melanoma Recognition in Dermoscopy Images., , , , , and . MLMI, volume 9352 of Lecture Notes in Computer Science, page 118-126. Springer, (2015)Generative Enhancement for 3D Medical Images., , , , , and . CoRR, (2024)Fully Authentic Visual Question Answering Dataset from Online Communities., , , , , and . CoRR, (2023)CvT: Introducing Convolutions to Vision Transformers., , , , , , and . ICCV, page 22-31. IEEE, (2021)