From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning., , , , , , , и . CVPR, стр. 10714-10726. IEEE, (2023)Look Before You Speak: Visually Contextualized Utterances., , и . CVPR, стр. 16877-16887. Computer Vision Foundation / IEEE, (2021)Image Question Answering using Convolutional Neural Network with Dynamic Parameter Prediction., , и . CoRR, (2015)MarioQA: Answering Questions by Watching Gameplay Videos., , , и . CoRR, (2016)Regularizing Neural Networks via Stochastic Branch Layers., , , и . ACML, том 101 из Proceedings of Machine Learning Research, стр. 678-693. PMLR, (2019)Learning Correlation Structures for Vision Transformers., , , и . CoRR, (2024)Reinforcing an Image Caption Generator Using Off-Line Human Feedback., , , , и . AAAI, стр. 2693-2700. AAAI Press, (2020)AVFormer: Injecting Vision into Frozen Speech Models for Zero-Shot AV-ASR., , и . CVPR, стр. 22922-22931. IEEE, (2023)Learning for Single-Shot Confidence Calibration in Deep Neural Networks Through Stochastic Inferences., , и . CVPR, стр. 9030-9038. Computer Vision Foundation / IEEE, (2019)Learning Audio-Video Modalities from Image Captions., , , , , , и . ECCV (14), том 13674 из Lecture Notes in Computer Science, стр. 407-426. Springer, (2022)