From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Policy gradient approaches for multi-objective sequential decision making., , , , и . IJCNN, стр. 2323-2330. IEEE, (2014)Piecewise constant reinforcement learning for robotic applications., , и . ICINCO-ICSO, стр. 214-221. INSTICC Press, (2007)978-972-8865-82-5.Equilibrium approximation in simulation-based extensive-form games., и . AAMAS, стр. 199-206. IFAAMAS, (2011)Bifurcation Analysis of Reinforcement Learning Agents in the Selten's Horse Game., , , и . Adaptive Agents and Multi-Agents Systems, том 4865 из Lecture Notes in Computer Science, стр. 129-144. Springer, (2007)Best Arm Identification for Stochastic Rising Bandits., , , , и . CoRR, (2023)Simultaneously Updating All Persistence Values in Reinforcement Learning., , , , и . AAAI, стр. 9668-9676. AAAI Press, (2023)Policy Optimization as Online Learning with Mediator Feedback., , , и . AAAI, стр. 8958-8966. AAAI Press, (2021)Lifelong Hyper-Policy Optimization with Multiple Importance Sampling Regularization., , , и . AAAI, стр. 7525-7533. AAAI Press, (2022)Unsupervised Reinforcement Learning in Multiple Environments., , и . AAAI, стр. 7850-7858. AAAI Press, (2022)An Asymptotically Optimal Primal-Dual Incremental Algorithm for Contextual Linear Bandits., , , и . NeurIPS, (2020)