From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Optimizing over a Restricted Policy Class in MDPs., , , и . AISTATS, том 89 из Proceedings of Machine Learning Research, стр. 3042-3050. PMLR, (2019)Optimizing over a Restricted Policy Class in Markov Decision Processes., , , и . CoRR, (2018)Bayesian Policy Gradient and Actor-Critic Algorithms., , и . J. Mach. Learn. Res., (2016)A multiagent reinforcement learning algorithm by dynamically merging markov decision processes., и . AAMAS, стр. 845-846. ACM, (2002)Upper-Confidence-Bound Algorithms for Active Learning in Multi-armed Bandits., , , , и . ALT, том 6925 из Lecture Notes in Computer Science, стр. 189-203. Springer, (2011)Regularized Policy Iteration., , , и . NIPS, стр. 441-448. Curran Associates, Inc., (2008)Actor-Critic Algorithms for Risk-Sensitive MDPs., и . NIPS, стр. 252-260. (2013)Efficient Risk-Averse Reinforcement Learning., , , и . NeurIPS, (2022)Safe Policy Improvement by Minimizing Robust Baseline Regret., , и . NIPS, стр. 2298-2306. (2016)Feature and Parameter Selection in Stochastic Linear Bandits., , , , и . ICML, том 162 из Proceedings of Machine Learning Research, стр. 15927-15958. PMLR, (2022)