Author of the publication

Dissecting the Effects of SGD Noise in Distinct Regimes of Deep Learning.

, , and . ICML, volume 202 of Proceedings of Machine Learning Research, page 30381-30405. PMLR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Relation between bid--ask spread, impact and volatility in order-driven markets, , , , and . Quantitative Finance, 8 (1): 41--57 (2008)A jamming transition from under- to over-parametrization affects loss landscape and generalization., , , , , and . CoRR, (2018)Perspective: A Phase Diagram for Deep Learning unifying Jamming, Feature Learning and Lazy Training., , and . CoRR, (2020)Comparing Dynamics: Deep Neural Networks versus Glassy Systems., , , , , , , , and . ICML, volume 80 of Proceedings of Machine Learning Research, page 324-333. PMLR, (2018)Asymptotic learning curves of kernel methods: empirical data v.s. Teacher-Student paradigm., , and . CoRR, (2019)The jamming transition as a paradigm to understand the loss landscape of deep neural networks, , , , , , and . (2018)cite arxiv:1809.09349.How memory architecture affects performance and learning in simple POMDPs., , and . CoRR, (2021)Learning sparse features can lead to overfitting in neural networks., , , and . NeurIPS, (2022)Locality defeats the curse of dimensionality in convolutional teacher-student scenarios., , and . NeurIPS, page 9456-9467. (2021)Failure and success of the spectral bias prediction for Laplace Kernel Ridge Regression: the case of low-dimensional data., , and . ICML, volume 162 of Proceedings of Machine Learning Research, page 21548-21583. PMLR, (2022)