From post

The Evolutionary Dynamics of Soft-Max Policy Gradient in Multi-Agent Settings.

, , , , и . AAMAS, стр. 1545-1547. International Foundation for Autonomous Agents and Multiagent Systems (IFAAMAS), (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Sequential Information Design: Learning to Persuade in the Dark., , , , и . NeurIPS, (2022)Dark-Pool Smart Order Routing: a Combinatorial Multi-armed Bandit Approach., , , , и . ICAIF, стр. 352-360. ACM, (2022)No-Regret Learning in Bilateral Trade via Global Budget Balance., , , и . STOC, стр. 247-258. ACM, (2024)Constrained Phi-Equilibria., , , , и . ICML, том 202 из Proceedings of Machine Learning Research, стр. 2184-2205. PMLR, (2023)Learning Extensive-Form Perfect Equilibria in Two-Player Zero-Sum Sequential Games., , и . AISTATS, том 238 из Proceedings of Machine Learning Research, стр. 2152-2160. PMLR, (2024)Beyond Primal-Dual Methods in Bandits with Stochastic and Adversarial Constraints., , , и . CoRR, (2024)Last-iterate Convergence to Trembling-hand Perfect Equilibria., , и . CoRR, (2022)Safe Learning in Tree-Form Sequential Decision Making: Handling Hard and Soft Constraints., , , , , и . ICML, том 162 из Proceedings of Machine Learning Research, стр. 1854-1873. PMLR, (2022)Optimal Rates and Efficient Algorithms for Online Bayesian Persuasion., , , , , и . ICML, том 202 из Proceedings of Machine Learning Research, стр. 2164-2183. PMLR, (2023)No-Regret Learning in Bilateral Trade via Global Budget Balance., , , и . CoRR, (2023)