From post

Enforcing Hard Constraints with Soft Barriers: Safe Reinforcement Learning in Unknown Stochastic Environments.

, , , , , , , , и . ICML, том 202 из Proceedings of Machine Learning Research, стр. 36593-36604. PMLR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

One Objective to Rule Them All: A Maximization Objective Fusing Estimation and Planning for Exploration., , , , , , , , и . CoRR, (2023)Towards General Function Approximation in Zero-Sum Markov Games., , , и . CoRR, (2021)Misspecified nonconvex statistical optimization for sparse phase retrieval., , , , , и . Math. Program., 176 (1-2): 545-571 (2019)Generative Adversarial Imitation Learning with Neural Networks: Global Optimality and Convergence Rate., , , и . CoRR, (2020)Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium., , , и . Math. Oper. Res., 48 (1): 433-462 (февраля 2023)Exponential Family Model-Based Reinforcement Learning via Score Matching., , , , , и . NeurIPS, (2022)Relational Reasoning via Set Transformers: Provable Efficiency and Applications to MARL., , , , , и . NeurIPS, (2022)Provably Efficient Neural GTD for Off-Policy Learning., , , и . NeurIPS, (2020)Provably Efficient Neural Estimation of Structural Equation Models: An Adversarial Approach., , , , , и . NeurIPS, (2020)Upper Confidence Primal-Dual Reinforcement Learning for CMDP with Adversarial Loss., , , , и . NeurIPS, (2020)