Author of the publication

The Power of Exploiter: Provable Multi-Agent RL in Large State Spaces.

, , and . ICML, volume 162 of Proceedings of Machine Learning Research, page 10251-10279. PMLR, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A Tight Lower Bound for Uniformly Stable Algorithms., and . CoRR, (2020)The Power of Exploiter: Provable Multi-Agent RL in Large State Spaces., , and . CoRR, (2021)A Joint Allocation Algorithm of Computing and Communication Resources Based on Reinforcement Learning in MEC System., and . J. Inf. Process. Syst., 17 (4): 721-736 (2021)Evaluation of Water Quality Using Grey Clustering., and . WKDD, page 803-805. IEEE Computer Society, (2009)Pre-layout wire length and congestion estimation., and . DAC, page 582-587. ACM, (2004)Multi-attribute aware multipath data scheduling strategy for efficient MPTCP-based data delivery., , , , and . APCC, page 248-253. IEEE, (2016)Towards Agile Operation for Small Teams in Knowledge Intensive Organizations: A Collaboration Framework., , and . PRO-VE, volume 598 of IFIP Advances in Information and Communication Technology, page 263-272. Springer, (2020)Optimistic MLE: A Generic Model-Based Algorithm for Partially Observable Sequential Decision Making., , , and . STOC, page 363-376. ACM, (2023)A congestion-driven placement framework with local congestion prediction., and . ACM Great Lakes Symposium on VLSI, page 488-493. ACM, (2005)Sample-Efficient Reinforcement Learning of Partially Observable Markov Games., , and . NeurIPS, (2022)