Author of the publication

The Implicit Bias of Gradient Descent on Separable Data

, , , , and . (2017)cite arxiv:1710.10345Comment: Final JMLR version, with improved discussions over v3. Main improvements in journal version over conference version (v2 appeared in ICLR): We proved the measure zero case for main theorem (with implications for the rates), and the multi-class case.

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Regularization Guarantees Generalization in Bayesian Reinforcement Learning through Algorithmic Stability., , and . AAAI, page 8423-8431. AAAI Press, (2022)Exponentially vanishing sub-optimal local minima in multilayer neural networks., and . ICLR (Workshop), OpenReview.net, (2018)Physics-Aware Downsampling with Deep Learning for Scalable Flood Modeling., , , , and . NeurIPS, page 1378-1389. (2021)Accurate Post Training Quantization With Small Calibration Sets., , , , and . ICML, volume 139 of Proceedings of Machine Learning Research, page 4466-4475. PMLR, (2021)The Implicit Bias of Gradient Descent on Separable Data., , , , and . J. Mach. Learn. Res., (2018)Scaling FP8 training to trillion-token LLMs., , , and . CoRR, (2024)Binarized neural networks: Training deep neural networks with weights and activations constrained to+ 1 or-1, , , , and . arXiv preprint arXiv:1602.02830, (2016)How do infinite width bounded norm networks look in function space?, , , and . COLT, volume 99 of Proceedings of Machine Learning Research, page 2667-2690. PMLR, (2019)Memristor-Based Multilayer Neural Networks With Online Gradient Descent Training., , , , and . IEEE Trans. Neural Networks Learn. Syst., 26 (10): 2408-2421 (2015)Beyond Signal Propagation: Is Feature Diversity Necessary in Deep Neural Network Initialization?, , and . ICML, volume 119 of Proceedings of Machine Learning Research, page 960-969. PMLR, (2020)