Author of the publication

Towards Understanding Ensemble, Knowledge Distillation and Self-Distillation in Deep Learning.

, and . ICLR, OpenReview.net, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Katyusha X: Practical Momentum Method for Stochastic Sum-of-Nonconvex Optimization.. ICML, volume 80 of Proceedings of Machine Learning Research, page 179-185. PMLR, (2018)Knightian self uncertainty in the vcg mechanism for unrestricted combinatorial auctions., , and . EC, page 619-620. ACM, (2014)A Local Algorithm for Finding Well-Connected Clusters., , and . ICML (3), volume 28 of JMLR Workshop and Conference Proceedings, page 396-404. JMLR.org, (2013)Backward Feature Correction: How Deep Learning Performs Deep Learning, and . (2020)cite arxiv:2001.04413.UniVR: A Universal Variance Reduction Framework for Proximal Stochastic Gradient Method., and . CoRR, (2015)Near-Optimal Design of Experiments via Regret Minimization., , , and . ICML, volume 70 of Proceedings of Machine Learning Research, page 126-135. PMLR, (2017)Feature Purification: How Adversarial Training Performs Robust Deep Learning., and . FOCS, page 977-988. IEEE, (2021)Even Faster Accelerated Coordinate Descent Using Non-Uniform Sampling., , , and . ICML, volume 48 of JMLR Workshop and Conference Proceedings, page 1110-1119. JMLR.org, (2016)A Convergence Theory for Deep Learning via Over-Parameterization., , and . CoRR, (2018)Towards Understanding Ensemble, Knowledge Distillation and Self-Distillation in Deep Learning., and . CoRR, (2020)