Author of the publication

From block-Toeplitz matrices to differential equations on graphs: towards a general theory for scalable masked Transformers.

, , , , , , , , , and . ICML, volume 162 of Proceedings of Machine Learning Research, page 3962-3983. PMLR, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Time Dependence in Non-Autonomous Neural ODEs., , , , , , , , and . CoRR, (2020)Binary embeddings with structured hashed projections., , , , , and . ICML, volume 48 of JMLR Workshop and Conference Proceedings, page 344-353. JMLR.org, (2016)Explaining How a Deep Neural Network Trained with End-to-End Learning Steers a Car., , , , , , and . CoRR, (2017)Fast Online Clustering with Randomized Skeleton Sets., , and . CoRR, (2015)The Geometry of Random Features., , , , , and . AISTATS, volume 84 of Proceedings of Machine Learning Research, page 1-9. PMLR, (2018)Online Hyper-parameter Tuning in Off-policy Learning via Evolutionary Strategies., and . CoRR, (2020)Hybrid Random Features., , , , , , , , , and 3 other author(s). CoRR, (2021)Masked Language Modeling for Proteins via Linearly Scalable Long-Context Transformers., , , , , , , , and . CoRR, (2020)SARA-RT: Scaling up Robotics Transformers with Self-Adaptive Robust Attention., , , , , , , , , and 4 other author(s). CoRR, (2023)Quasi-Monte Carlo Graph Random Features., , and . CoRR, (2023)