Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium.

Q. Xie, Y. Chen, Z. Wang, and Z. Yang. Math. Oper. Res., 48 (1): 433-462 (February 2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Yang Yang

Other publications of authors with the same name

One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration.Z. Liu, M. Lu, W. Xiong, H. Zhong, H. Hu, S. Zhang, S. Zheng, Z. Yang, and Z. Wang. CoRR, (2023)Towards General Function Approximation in Zero-Sum Markov Games.B. Huang, J. Lee, Z. Wang, and Z. Yang. CoRR, (2021)Generative Adversarial Imitation Learning with Neural Networks: Global Optimality and Convergence Rate.Y. Zhang, Q. Cai, Z. Yang, and Z. Wang. CoRR, (2020)Misspecified nonconvex statistical optimization for sparse phase retrieval.Z. Yang, L. Yang, E. Fang, T. Zhao, Z. Wang, and M. Neykov. Math. Program., 176 (1-2): 545-571 (2019)Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium.Q. Xie, Y. Chen, Z. Wang, and Z. Yang. Math. Oper. Res., 48 (1): 433-462 (February 2023)Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments.Y. Wang, S. Zhan, R. Jiao, Z. Wang, W. Jin, Z. Yang, Z. Wang, C. Huang, and Q. Zhu. ICML, volume 202 of Proceedings of Machine Learning Research, page 36593-36604. PMLR, (2023)Doubly Robust Off-Policy Actor-Critic: Convergence and Optimality.T. Xu, Z. Yang, Z. Wang, and Y. Liang. ICML, volume 139 of Proceedings of Machine Learning Research, page 11581-11591. PMLR, (2021)On Reward-Free RL with Kernel and Neural Function Approximations: Single-Agent MDP and Markov Game.S. Qiu, J. Ye, Z. Wang, and Z. Yang. ICML, volume 139 of Proceedings of Machine Learning Research, page 8737-8747. PMLR, (2021)Learning While Playing in Mean-Field Games: Convergence and Optimality.Q. Xie, Z. Yang, Z. Wang, and A. Minca. ICML, volume 139 of Proceedings of Machine Learning Research, page 11436-11447. PMLR, (2021)Provably Efficient Neural GTD for Off-Policy Learning.H. Wai, Z. Yang, Z. Wang, and M. Hong. NeurIPS, (2020)

BibSonomy

Disambiguation of "Yang, Zhuoran"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium.

Please choose a person to relate this publication to

Yang Yang

Yang Yang

Yang Yang

Yang Yang

Yang Yang

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Yang, Zhuoran"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium.

Please choose a person to relate this publication to

Yang Yang

Yang Yang

Yang Yang

Yang Yang

Yang Yang

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium.