Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Pretraining Without Attention., , , and . EMNLP (Findings), page 58-69. Association for Computational Linguistics, (2023)An Empirical Study of Mamba-based Language Models., , , , , , , , , and 6 other author(s). CoRR, (2024)How to Train your HIPPO: State Space Models with Generalized Orthogonal Basis Projections., , , , and . ICLR, OpenReview.net, (2023)Caduceus: Bi-Directional Equivariant Long-Range DNA Sequence Modeling., , , , , and . ICML, OpenReview.net, (2024)Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models., , , , , , , , , and 7 other author(s). CoRR, (2024)Learning Fast Algorithms for Linear Transforms Using Butterfly Factorizations., , , , and . ICML, volume 97 of Proceedings of Machine Learning Research, page 1517-1527. PMLR, (2019)Learning Mixed-Curvature Representations in Product Spaces., , , and . ICLR (Poster), OpenReview.net, (2019)S4ND: Modeling Images and Videos as Multidimensional Signals with State Spaces., , , , , , , and . NeurIPS, (2022)HoroPCA: Hyperbolic Dimensionality Reduction via Horospherical Projections., , , and . ICML, volume 139 of Proceedings of Machine Learning Research, page 1419-1429. PMLR, (2021)Modelling Long Range Dependencies in $N$D: From Task-Specific to a General Purpose CNN., , , , , , , and . ICLR, OpenReview.net, (2023)