Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Towards Understanding Ensemble, Knowledge Distillation and Self-Distillation in Deep Learning.

Z. Allen-Zhu, and Y. Li. ICLR, OpenReview.net, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Katrin Allen

James Allen

Allen Fear

Allen Ehrlicher

Rachel Allen

Other publications of authors with the same name

Katyusha X: Practical Momentum Method for Stochastic Sum-of-Nonconvex Optimization.Z. Allen-Zhu. ICML, volume 80 of Proceedings of Machine Learning Research, page 179-185. PMLR, (2018)Knightian self uncertainty in the vcg mechanism for unrestricted combinatorial auctions.A. Chiesa, S. Micali, and Z. Zhu. EC, page 619-620. ACM, (2014)A Local Algorithm for Finding Well-Connected Clusters.Z. Zhu, S. Lattanzi, and V. Mirrokni. ICML (3), volume 28 of JMLR Workshop and Conference Proceedings, page 396-404. JMLR.org, (2013)Backward Feature Correction: How Deep Learning Performs Deep LearningZ. Allen-Zhu, and Y. Li. (2020)cite arxiv:2001.04413.UniVR: A Universal Variance Reduction Framework for Proximal Stochastic Gradient Method.Z. Zhu, and Y. Yuan. CoRR, (2015)Near-Optimal Design of Experiments via Regret Minimization.Z. Allen-Zhu, Y. Li, A. Singh, and Y. Wang. ICML, volume 70 of Proceedings of Machine Learning Research, page 126-135. PMLR, (2017)Feature Purification: How Adversarial Training Performs Robust Deep Learning.Z. Allen-Zhu, and Y. Li. FOCS, page 977-988. IEEE, (2021)Even Faster Accelerated Coordinate Descent Using Non-Uniform Sampling.Z. Zhu, Z. Qu, P. Richtárik, and Y. Yuan. ICML, volume 48 of JMLR Workshop and Conference Proceedings, page 1110-1119. JMLR.org, (2016)A Convergence Theory for Deep Learning via Over-Parameterization.Z. Allen-Zhu, Y. Li, and Z. Song. CoRR, (2018)Towards Understanding Ensemble, Knowledge Distillation and Self-Distillation in Deep Learning.Z. Allen-Zhu, and Y. Li. CoRR, (2020)

BibSonomy

Disambiguation of "Allen-Zhu, Zeyuan"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Towards Understanding Ensemble, Knowledge Distillation and Self-Distillation in Deep Learning.

Please choose a person to relate this publication to

Katrin Allen

James Allen

Allen Fear

Allen Ehrlicher

Rachel Allen

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Allen-Zhu, Zeyuan"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Towards Understanding Ensemble, Knowledge Distillation and Self-Distillation in Deep Learning.

Please choose a person to relate this publication to

Katrin Allen

James Allen

Allen Fear

Allen Ehrlicher

Rachel Allen

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Towards Understanding Ensemble, Knowledge Distillation and Self-Distillation in Deep Learning.