From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Performance Bounds and Suboptimal Policies for Multi-Period Investment., , , и . Found. Trends Optim., 1 (1): 1-72 (2014)Discovering Diverse Nearly Optimal Policies withSuccessor Features., , , , , и . CoRR, (2021)Conic Optimization via Operator Splitting and Homogeneous Self-Dual Embedding., , , и . J. Optimization Theory and Applications, 169 (3): 1042-1068 (2016)ReLOAD: Reinforcement Learning with Optimistic Ascent-Descent for Last-Iterate Convergence in Constrained MDPs., , , , , и . ICML, том 202 из Proceedings of Machine Learning Research, стр. 25303-25336. PMLR, (2023)Iterated approximate value functions., , и . ECC, стр. 3882-3888. IEEE, (2013)Combining policy gradient and Q-learning., , , и . ICLR (Poster), OpenReview.net, (2017)The Uncertainty Bellman Equation and Exploration., , , и . CoRR, (2017)Discovering a set of policies for the worst case reward., , , , , , и . ICLR, OpenReview.net, (2021)Matrix games with bandit feedback., , и . UAI, том 161 из Proceedings of Machine Learning Research, стр. 279-289. AUAI Press, (2021)Sample Efficient Reinforcement Learning with REINFORCE., , , и . AAAI, стр. 10887-10895. AAAI Press, (2021)