From post

копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments.

Y. Wang, S. Zhan, R. Jiao, Z. Wang, W. Jin, Z. Yang, Z. Wang, C. Huang, и Q. Zhu. ICML, том 202 из Proceedings of Machine Learning Research, стр. 36593-36604. PMLR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

Yang Yang

Другие публикации лиц с тем же именем

One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration.Z. Liu, M. Lu, W. Xiong, H. Zhong, H. Hu, S. Zhang, S. Zheng, Z. Yang, и Z. Wang. CoRR, (2023)Towards General Function Approximation in Zero-Sum Markov Games.B. Huang, J. Lee, Z. Wang, и Z. Yang. CoRR, (2021)Misspecified nonconvex statistical optimization for sparse phase retrieval.Z. Yang, L. Yang, E. Fang, T. Zhao, Z. Wang, и M. Neykov. Math. Program., 176 (1-2): 545-571 (2019)Generative Adversarial Imitation Learning with Neural Networks: Global Optimality and Convergence Rate.Y. Zhang, Q. Cai, Z. Yang, и Z. Wang. CoRR, (2020)Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium.Q. Xie, Y. Chen, Z. Wang, и Z. Yang. Math. Oper. Res., 48 (1): 433-462 (февраля 2023)Exponential Family Model-Based Reinforcement Learning via Score Matching.G. Li, J. Li, A. Kabra, N. Srebro, Z. Wang, и Z. Yang. NeurIPS, (2022)Relational Reasoning via Set Transformers: Provable Efficiency and Applications to MARL.F. Zhang, B. Liu, K. Wang, V. Tan, Z. Yang, и Z. Wang. NeurIPS, (2022)Provably Efficient Neural GTD for Off-Policy Learning.H. Wai, Z. Yang, Z. Wang, и M. Hong. NeurIPS, (2020)Provably Efficient Neural Estimation of Structural Equation Models: An Adversarial Approach.L. Liao, Y. Chen, Z. Yang, B. Dai, M. Kolar, и Z. Wang. NeurIPS, (2020)Upper Confidence Primal-Dual Reinforcement Learning for CMDP with Adversarial Loss.S. Qiu, X. Wei, Z. Yang, J. Ye, и Z. Wang. NeurIPS, (2020)

BibSonomy

Disambiguation

Please choose a person to relate this publication to

Yang Yang

Yang Yang

Yang Yang

Yang Yang

Yang Yang

Другие публикации лиц с тем же именем

Disambiguation