Author of the publication

Cancellation-Free Regret Bounds for Lagrangian Approaches in Constrained Markov Decision Processes.

, , , and . CoRR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Optimization for Reinforcement Learning: From Single Agent to Cooperative Agents., , , and . CoRR, (2019)TiAda: A Time-scale Adaptive Algorithm for Nonconvex Minimax Optimization., , and . CoRR, (2022)Optimal Guarantees for Algorithmic Reproducibility and Gradient Complexity in Convex Optimization., , , and . CoRR, (2023)On the Statistical Efficiency of Mean Field Reinforcement Learning with General Function Approximation., , and . CoRR, (2023)Provably Convergent Policy Optimization via Metric-aware Trust Region Methods., , , and . CoRR, (2023)Sample Complexity and Overparameterization Bounds for Temporal-Difference Learning With Neural Network Approximation., , , and . IEEE Trans. Autom. Control., 68 (5): 2891-2905 (May 2023)Scalable Bayesian Inference via Particle Mirror Descent., , , and . CoRR, (2015)Simulation Studies on Deep Reinforcement Learning for Building Control with Human Interaction., , , , and . CoRR, (2021)Optimization for Reinforcement Learning: From a single agent to cooperative agents., , , and . IEEE Signal Process. Mag., 37 (3): 123-135 (2020)Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space., , and . ICML, volume 202 of Proceedings of Machine Learning Research, page 1753-1800. PMLR, (2023)