From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic., , , , и . ICLR, OpenReview.net, (2017)Composable Action-Conditioned Predictors: Flexible Off-Policy Learning for Robot Navigation., , , и . CoRL, том 87 из Proceedings of Machine Learning Research, стр. 806-816. PMLR, (2018)Continuous Inverse Optimal Control with Locally Optimal Examples., и . ICML, icml.cc / Omnipress, (2012)Residual Reinforcement Learning for Robot Control., , , , , , , , и . CoRR, (2018)Learning Image-Conditioned Dynamics Models for Control of Underactuated Legged Millirobots., , , , , , и . IROS, стр. 4606-4613. IEEE, (2018)Learning Human Objectives by Evaluating Hypothetical Behavior., , , , и . CoRR, (2019)Model Inversion Networks for Model-Based Optimization., и . CoRR, (2019)Offline Reinforcement Learning: Tutorial, Review, and Perspectives on Open Problems, , , и . (2020)cite arxiv:2005.01643.Optimism-driven exploration for nonlinear systems., , , и . ICRA, стр. 3239-3246. IEEE, (2015)Adaptive Risk Minimization: A Meta-Learning Approach for Tackling Group Shift., , , , и . CoRR, (2020)