From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Llemma: An Open Language Model For Mathematics., , , , , , , , и . CoRR, (2023)MANSA: Learning Fast and Slow in Multi-Agent Systems., , , , , , , , , и . CoRR, (2023)Solving the Rubik's Cube Without Human Knowledge, , , и . (2018)cite arxiv:1805.07470Comment: First three authors contributed equally. Submitted to NIPS 2018.Faster Game Solving via Hyperparameter Schedules., , и . CoRR, (2024)Grasper: A Generalist Pursuer for Pursuit-Evasion Problems., , , , , , , и . AAMAS, стр. 1147-1155. International Foundation for Autonomous Agents and Multiagent Systems / ACM, (2024)Sequential Decision Making in Single-Agent and Multi-Agent Domains. University of California, Irvine, USA, (2022)Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning., , , , , , , , , и 24 other автор(ы). CoRR, (2022)Solving the Rubik's Cube with Approximate Policy Iteration., , , и . ICLR (Poster), OpenReview.net, (2019)Self-Play PSRO: Toward Optimal Populations in Two-Player Zero-Sum Games., , , , , и . CoRR, (2022)Confronting Reward Model Overoptimization with Constrained RLHF., , , , , , и . CoRR, (2023)