From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Disentangling Adaptive Gradient Methods from Learning Rates, , , , и . (2020)cite arxiv:2002.11803.Recurrent Convolutional Neural Networks Learn Succinct Learning Algorithms., , , и . NeurIPS, (2022)Learning Hidden Markov Models Using Conditional Samples., , , и . COLT, том 195 из Proceedings of Machine Learning Research, стр. 2014-2066. PMLR, (2023)Inductive Biases and Variable Creation in Self-Attention Mechanisms., , , и . ICML, том 162 из Proceedings of Machine Learning Research, стр. 5793-5831. PMLR, (2022)Sparsity in Partially Controllable Linear Systems., , , и . ICML, том 162 из Proceedings of Machine Learning Research, стр. 5851-5860. PMLR, (2022)Can large language models explore in-context?, , , , и . CoRR, (2024)Extreme Tensoring for Low-Memory Preconditioning., , , , и . CoRR, (2019)Extreme Tensoring for Low-Memory Preconditioning., , , , и . ICLR, OpenReview.net, (2020)Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone., , , , , , , , , и 77 other автор(ы). CoRR, (2024)Acceleration via Fractal Learning Rate Schedules., , и . ICML, том 139 из Proceedings of Machine Learning Research, стр. 87-99. PMLR, (2021)