From post

копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale

A. Dosovitskiy, L. Beyer, A. Kolesnikov, D. Weissenborn, X. Zhai, T. Unterthiner, M. Dehghani, M. Minderer, G. Heigold, S. Gelly, J. Uszkoreit, и N. Houlsby. (2020)cite arxiv:2010.11929Comment: Fine-tuning code and pre-trained models are available at https://github.com/google-research/vision_transformer. ICLR camera-ready version with 2 small modifications: 1) Added a discussion of CLS vs GAP classifier in the appendix, 2) Fixed an error in exaFLOPs computation in Figure 5 and Table 6 (relative performance of models is basically not affected).

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

Georg Heigold

Stefanie Heigold

David Georg

Jens Georg

Inna Georgieva

Другие публикации лиц с тем же именем

Conditional Object-Centric Learning from Video.T. Kipf, G. Elsayed, A. Mahendran, A. Stone, S. Sabour, G. Heigold, R. Jonschkowski, A. Dosovitskiy, и K. Greff. CoRR, (2021)Conditional Object-Centric Learning from Video.T. Kipf, G. Elsayed, A. Mahendran, A. Stone, S. Sabour, G. Heigold, R. Jonschkowski, A. Dosovitskiy, и K. Greff. ICLR, OpenReview.net, (2022)WFST Enabled Solutions to ASR Problems: Beyond HMM Decoding.B. Hoffmeister, G. Heigold, D. Rybach, R. Schlüter, и H. Ney. IEEE Trans. Speech Audio Process., 20 (2): 551-564 (2012)A Linguistic Evaluation of Rule-Based, Phrase-Based, and Neural MT Engines.A. Burchardt, V. Macketanz, J. Dehdari, G. Heigold, J. Peter, и P. Williams. Prague Bull. Math. Linguistics, (2017)Video OWL-ViT: Temporally-consistent open-world localization in video.G. Heigold, D. Keysers, M. Minderer, M. Lucic, A. Gritsenko, F. Yu, A. Bewley, и T. Kipf. ICCV, стр. 13756-13765. IEEE, (2023)Optimization Algorithms and Applications for Speech and Language Processing.S. Wright, D. Kanevsky, L. Deng, X. He, G. Heigold, и H. Li. IEEE Trans. Speech Audio Process., 21 (11): 2231-2243 (2013)Investigations on an EM-Style Optimization Algorithm for Discriminative Training of HMMs.G. Heigold, H. Ney, и R. Schlüter. IEEE ACM Trans. Audio Speech Lang. Process., 21 (12): 2616-2626 (2013)Object-Centric Learning with Slot Attention.F. Locatello, D. Weissenborn, T. Unterthiner, A. Mahendran, G. Heigold, J. Uszkoreit, A. Dosovitskiy, и T. Kipf. NeurIPS, (2020)ViViT: A Video Vision Transformer.A. Arnab, M. Dehghani, G. Heigold, C. Sun, M. Lucic, и C. Schmid. ICCV, стр. 6816-6826. IEEE, (2021)Equivalence of Generative and Log-Linear Models.G. Heigold, H. Ney, P. Lehnen, T. Gass, и R. Schlüter. IEEE Trans. Speech Audio Process., 19 (5): 1138-1148 (2011)

BibSonomy

Disambiguation