Author of the publication

Posterior sampling for multi-agent reinforcement learning: solving extensive games with imperfect information.

, , and . ICLR, OpenReview.net, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Lazy-CFR: fast and near-optimal regret minimization for extensive games with imperfect information., , , , and . ICLR, OpenReview.net, (2020)Selective Verification Strategy for Learning From Crowds., , and . AAAI, page 4147-4154. AAAI Press, (2018)Online Label Aggregation: A Variational Bayesian Approach., , , , and . WWW, page 1904-1915. ACM / IW3C2, (2021)Identify the Nash Equilibrium in Static Games with Random Payoffs., , and . ICML, volume 70 of Proceedings of Machine Learning Research, page 4160-4169. PMLR, (2017)Stabilizing Policy Gradients for Stochastic Differential Equations via Consistency with Perturbation Process., , and . CoRR, (2024)Simultaneously Learning Stochastic and Adversarial Bandits with General Graph Feedback., , and . ICML, volume 162 of Proceedings of Machine Learning Research, page 11473-11482. PMLR, (2022)Exploration Analysis in Finite-Horizon Turn-based Stochastic Games., , , and . UAI, volume 124 of Proceedings of Machine Learning Research, page 201-210. AUAI Press, (2020)Lazy-CFR: a fast regret minimization algorithm for extensive games with imperfect information., , , , and . CoRR, (2018)Racing Thompson: an Efficient Algorithm for Thompson Sampling with Non-conjugate Priors., , and . CoRR, (2017)Racing Thompson: an Efficient Algorithm for Thompson Sampling with Non-conjugate Priors., , and . ICML, volume 80 of Proceedings of Machine Learning Research, page 5995-6003. PMLR, (2018)