Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Learning Correlation Structures for Vision Transformers.

M. Kim, P. Seo, C. Schmid, and M. Cho. CoRR, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Eunyoung Seo

Paek Pyung Seon

Ean-Jeong Seo

Ki-Chang Seong

Bong-Seock Seo

Other publications of authors with the same name

Vid2Seq: Large-Scale Pretraining of a Visual Language Model for Dense Video Captioning.A. Yang, A. Nagrani, P. Seo, A. Miech, J. Pont-Tuset, I. Laptev, J. Sivic, and C. Schmid. CVPR, page 10714-10726. IEEE, (2023)Look Before You Speak: Visually Contextualized Utterances.P. Seo, A. Nagrani, and C. Schmid. CVPR, page 16877-16887. Computer Vision Foundation / IEEE, (2021)Image Question Answering using Convolutional Neural Network with Dynamic Parameter Prediction.H. Noh, P. Seo, and B. Han. CoRR, (2015)MarioQA: Answering Questions by Watching Gameplay Videos.J. Mun, P. Seo, I. Jung, and B. Han. CoRR, (2016)Regularizing Neural Networks via Stochastic Branch Layers.W. Park, P. Seo, B. Han, and M. Cho. ACML, volume 101 of Proceedings of Machine Learning Research, page 678-693. PMLR, (2019)Learning Correlation Structures for Vision Transformers.M. Kim, P. Seo, C. Schmid, and M. Cho. CoRR, (2024)Reinforcing an Image Caption Generator Using Off-Line Human Feedback.P. Seo, P. Sharma, T. Levinboim, B. Han, and R. Soricut. AAAI, page 2693-2700. AAAI Press, (2020)AVFormer: Injecting Vision into Frozen Speech Models for Zero-Shot AV-ASR.P. Seo, A. Nagrani, and C. Schmid. CVPR, page 22922-22931. IEEE, (2023)Learning for Single-Shot Confidence Calibration in Deep Neural Networks Through Stochastic Inferences.S. Seo, P. Seo, and B. Han. CVPR, page 9030-9038. Computer Vision Foundation / IEEE, (2019)Learning Audio-Video Modalities from Image Captions.A. Nagrani, P. Seo, B. Seybold, A. Hauth, S. Manen, C. Sun, and C. Schmid. ECCV (14), volume 13674 of Lecture Notes in Computer Science, page 407-426. Springer, (2022)

BibSonomy

Disambiguation of "Seo, Paul Hongsuck"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Learning Correlation Structures for Vision Transformers.

Please choose a person to relate this publication to

Eunyoung Seo

Paek Pyung Seon

Ean-Jeong Seo

Ki-Chang Seong

Bong-Seock Seo

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Seo, Paul Hongsuck"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Learning Correlation Structures for Vision Transformers.

Please choose a person to relate this publication to

Eunyoung Seo

Paek Pyung Seon

Ean-Jeong Seo

Ki-Chang Seong

Bong-Seock Seo

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Learning Correlation Structures for Vision Transformers.