Author of the publication

Iterate averaging as regularization for stochastic gradient descent

, and . (2018)cite arxiv:1802.08009.

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Faster saddle-point optimization for solving large-scale Markov decision processes., and . L4DC, volume 120 of Proceedings of Machine Learning Research, page 413-423. PMLR, (2020)Offline Primal-Dual Reinforcement Learning for Linear MDPs., , , and . CoRR, (2023)Convex Analytic Theory for Convex Q-Learning., , , and . CDC, page 4065-4071. IEEE, (2022)Proximal Point Imitation Learning., , , , and . NeurIPS, (2022)Apprenticeship Learning using Inverse Reinforcement Learning and Gradient Methods., and . UAI, page 295-302. AUAI Press, (2007)Online combinatorial optimization with stochastic decision sets and adversarial losses., and . NIPS, page 2780-2788. (2014)Optimistic Planning by Regularized Dynamic Programming., and . ICML, volume 202 of Proceedings of Machine Learning Research, page 25337-25357. PMLR, (2023)Logistic Q-Learning., , , and . AISTATS, volume 130 of Proceedings of Machine Learning Research, page 3610-3618. PMLR, (2021)Nonstochastic Contextual Combinatorial Bandits., , , and . AISTATS, volume 206 of Proceedings of Machine Learning Research, page 8771-8813. PMLR, (2023)Bandit Principal Component Analysis., and . COLT, volume 99 of Proceedings of Machine Learning Research, page 1994-2024. PMLR, (2019)