Author of the publication

Identifying and attacking the saddle point problem in high-dimensional non-convex optimization

, , , , , and . (2014)cite arxiv:1406.2572Comment: The theoretical review and analysis in this article draw heavily from arXiv:1405.4604 cs.LG.

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Fundamental bounds on the fidelity of sensory cortical coding., , , , , , , , , and . Nat., 580 (7801): 100-105 (2020)Short-term memory in neuronal networks through dynamical compressed sensing., and . NIPS, page 667-675. Curran Associates, Inc., (2010)Deep learning versus kernel learning: an empirical study of loss landscape geometry and the time evolution of the Neural Tangent Kernel., , , , , and . NeurIPS, (2020)Identifying Learning Rules From Neural Network Observables., , , and . NeurIPS, (2020)Pruning neural networks without any data by iteratively conserving synaptic flow., , , and . NeurIPS, (2020)Reverse engineering recurrent networks for sentiment classification reveals line attractor dynamics., , , , and . NeurIPS, page 15670-15679. (2019)A theory of high dimensional regression with arbitrary correlations between input features and target functions: sample complexity, multiple descent curves and a hierarchy of phase transitions., and . ICML, volume 139 of Proceedings of Machine Learning Research, page 7578-7587. PMLR, (2021)Emergent properties of the local geometry of neural loss landscapes, and . (2019)cite arxiv:1910.05929Comment: 10 pages, 8 figures.Get rich quick: exact solutions reveal how unbalanced initializations promote rapid feature learning., , , , , , and . CoRR, (2024)Exact solutions to the nonlinear dynamics of learning in deep linear neural networks., , and . ICLR, (2014)