Author of the publication

Linear Convergence of Adaptive Stochastic Gradient Descent.

, , and . AISTATS, volume 108 of Proceedings of Machine Learning Research, page 1475-1485. PMLR, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

WNGrad: Learn the Learning Rate in Gradient Descent, , and . (2018)cite arxiv:1803.02865Comment: 10 pages, 3 figures, conference.Research on the Development Trend and Coping Strategies of Internet Finance., and . CSIA, volume 928 of Advances in Intelligent Systems and Computing, page 543-549. Springer, (2019)Understanding Int4 Quantization for Language Models: Latency Speedup, Composability, and Failure Cases., , , , and . ICML, volume 202 of Proceedings of Machine Learning Research, page 37524-37539. PMLR, (2023)Guaranteeing performance yield in high-level synthesis., , and . ICCAD, page 303-309. ACM, (2006)Estimating the Proportion of True Null Hypotheses in Nonparametric Exponential Mixture Model with Appication to the Leukemia Gene Expression Data., , , and . Communications in Statistics - Simulation and Computation, 41 (9): 1580-1592 (2012)Analysis of Subthreshold Finfet Circuits for Ultra-Low Power Design., , and . SoCC, page 91-92. IEEE, (2006)All-optical time domain 160 Gb/s ADD/DROP based on pump depletion and nonlinearities in a single PPLN waveguide, , , and . Optical Fiber Communication - incudes post deadline papers, 2009. OFC 2009. Conference on, page 1-3--. (2009)Methionine-Capped Gold Nanoclusters as a Fluorescence-Enhanced Probe for Cadmium(II) Sensing., , , , and . Sensors, 18 (2): 658 (2018)Design exploration of hybrid caches with disparate memory technologies., , , , , and . ACM Trans. Archit. Code Optim., 7 (3): 15:1-15:34 (2010)Linear Convergence of Adaptive Stochastic Gradient Descent., , and . CoRR, (2019)