Author of the publication

Which transformer architecture fits my data? A vocabulary bottleneck in self-attention.

, , , and . ICML, volume 139 of Proceedings of Machine Learning Research, page 11170-11181. PMLR, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Deep Learning and Quantum Entanglement: Fundamental Connections with Implications to Network Design., , , and . CoRR, (2017)Revisiting Single-View Shape Tensors: Theory and Applications., and . ECCV (2), volume 2351 of Lecture Notes in Computer Science, page 399-414. Springer, (2002)Threading Fundamental Matrices., and . ECCV (1), volume 1406 of Lecture Notes in Computer Science, page 124-140. Springer, (1998)Stereo-Assist: Top-down stereo for driver assistance systems., , and . Intelligent Vehicles Symposium, page 723-730. IEEE, (2010)Trajectory Triangulation of Lines: Reconstruction of a 3D point Moving along a Line from a Monocular Image Sequence., and . CVPR, page 2062-2066. IEEE Computer Society, (1999)Q-Warping: Direct Computation of Quadratic Reference Surfaces., and . CVPR, page 1333-1338. IEEE Computer Society, (1999)Model-based brightness constraints: on direct estimation of structure and motion., and . CVPR, page 400-406. IEEE Computer Society, (1997)Trajectory Triangulation over Conic Sections., , and . ICCV, page 330-336. IEEE Computer Society, (1999)Ambiguity in Reconstruction from Images of Six Points., and . ICCV, page 703-708. IEEE Computer Society, (1998)Multi-Frame Infinitesimal Motion Model for the Reconstruction of (Dynamic) Scenes with Multiple Linearly Moving Objects., and . ICCV, page 592-599. IEEE Computer Society, (2001)