Author of the publication

VL-InterpreT: An Interactive Visualization Tool for Interpreting Vision-Language Transformers.

, , , , , , and . CVPR, page 21374-21383. IEEE, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Opinion-based Relational Pivoting for Cross-domain Aspect Term Extraction., , , , , and . WASSA@ACL, page 104-112. Association for Computational Linguistics, (2022)Improving Video Retrieval Using Multilingual Knowledge Transfer., , , , , and . ECIR (1), volume 13980 of Lecture Notes in Computer Science, page 669-684. Springer, (2023)First Workshop on Knowledge Injection in Neural Networks (KINN)., , , , and . CIKM, page 4882-4883. ACM, (2021)Brain encoding models based on multimodal transformers can transfer across language and vision., , , , and . CoRR, (2023)Probing Intersectional Biases in Vision-Language Models with Counterfactual Examples., , , , and . CoRR, (2023)KD-VLP: Improving End-to-End Vision-and-Language Pretraining with Object Knowledge Distillation., , , , , and . CoRR, (2021)LDM3D: Latent Diffusion Model for 3D., , , , , , , , , and 1 other author(s). CoRR, (2023)Is Multimodal Vision Supervision Beneficial to Language?, and . CVPR Workshops, page 2637-2642. IEEE, (2023)KD-VLP: Improving End-to-End Vision-and-Language Pretraining with Object Knowledge Distillation., , , , , and . NAACL-HLT (Findings), page 1589-1600. Association for Computational Linguistics, (2022)NeuroPrompts: An Adaptive Framework to Optimize Prompts for Text-to-Image Generation., , and . CoRR, (2023)