Author of the publication

Do Vision-and-Language Transformers Learn Grounded Predicate-Noun Dependencies?

, , , , and . EMNLP, page 1538-1555. Association for Computational Linguistics, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Explaining First Impressions: Modeling, Recognizing, and Explaining Apparent Personality from Videos., , , , , , , , , and 7 other author(s). CoRR, (2018)Video Corpus Annotation Using Active Learning., and . ECIR, volume 4956 of Lecture Notes in Computer Science, page 187-198. Springer, (2008)Deep Networks with Adaptive Nyström Approximation., , , and . CoRR, (2019)ChaLearn Looking at People: Inpainting and Denoising challenges., , , , , , , , and . CoRR, (2021)Automatic facial expressions, gaze direction and head movements generation of a virtual agent., , and . ICMI Companion, page 79-88. ACM, (2022)Towards the generation of synchronized and believable non-verbal facial behaviors of a talking virtual agent., , , , and . ICMI Companion, page 228-237. ACM, (2023)PSM-nets: Compressing Neural Networks with Product of Sparse Matrices., , , , and . IJCNN, page 1-8. IEEE, (2021)Are Vision-Language Transformers Learning Multimodal Representations? A Probing Perspective., , , and . AAAI, page 11248-11257. AAAI Press, (2022)Implicit Regularization with Polynomial Growth in Deep Tensor Factorization., , , , and . ICML, volume 162 of Proceedings of Machine Learning Research, page 8484-8501. PMLR, (2022)Efficient image concept indexing by harmonic & arithmetic profiles entropy., , and . ICIP, page 277-280. IEEE, (2009)