Author of the publication

Scaling Local Self-Attention for Parameter Efficient Visual Backbones.

, , , , , and . CVPR, page 12894-12904. Computer Vision Foundation / IEEE, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Mesh-tensorflow: Deep learning for supercomputers, , , , , , , , , and 1 other author(s). Advances in Neural Information Processing Systems, page 10435--10444. (2018)Beyond Parallel Data: Joint Word Alignment and Decipherment Improves Machine Translation., , and . EMNLP, page 557-565. ACL, (2014)Supertagging With LSTMs., , , and . HLT-NAACL, page 232-237. The Association for Computational Linguistics, (2016)Simple, Fast Noise-Contrastive Estimation for Large RNN Vocabularies., , , and . HLT-NAACL, page 1217-1222. The Association for Computational Linguistics, (2016)Music Transformer, , , , , , , , , and . (2018)cite arxiv:1809.04281Comment: Improved skewing section and accompanying figures. Previous titles are Än Improved Relative Self-Attention Mechanism for Transformer with Application to Music Generation" and "Music Transformer".Efficient Content-Based Sparse Attention with Routing Transformers., , , and . CoRR, (2020)Aligning context-based statistical models of language with brain activity during reading., , , and . EMNLP, page 233-243. ACL, (2014)Models and Training for Unsupervised Preposition Sense Disambiguation., , , , and . ACL (2), page 323-328. The Association for Computer Linguistics, (2011)Radiobot-CFF: a spoken dialogue system for military training., , , , , , and . INTERSPEECH, ISCA, (2006)Scaling Local Self-Attention for Parameter Efficient Visual Backbones., , , , , and . CVPR, page 12894-12904. Computer Vision Foundation / IEEE, (2021)