From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Scaling laws for single-agent reinforcement learning., , и . CoRR, (2023)Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations., , , , , и . CoRR, (2017)Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations., , , , , , и . Robotics: Science and Systems, (2018)Training language models to follow instructions with human feedback., , , , , , , , , и 10 other автор(ы). NeurIPS, (2022)OpenAI Gym, , , , , , и . (2016)cite arxiv:1606.01540.Let's Verify Step by Step., , , , , , , , , и . ICLR, OpenReview.net, (2024)Model-Based Reinforcement Learning via Meta-Policy Optimization., , , , , и . CoRL, том 87 из Proceedings of Machine Learning Research, стр. 617-629. PMLR, (2018)Policy Gradient Search: Online Planning and Expert Iteration without Search Trees., , , , и . CoRR, (2019)Distribution Augmentation for Generative Modeling., , , , , , и . ICML, том 119 из Proceedings of Machine Learning Research, стр. 5006-5019. PMLR, (2020)Phasic Policy Gradient., , , и . ICML, том 139 из Proceedings of Machine Learning Research, стр. 2020-2027. PMLR, (2021)