Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Cancellation-Free Regret Bounds for Lagrangian Approaches in Constrained Markov Decision Processes.

A. Müller, P. Alatur, G. Ramponi, and N. He. CoRR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Ning He

Hongxia He

Chunmao He

Xiaowen He

Hucang He

Other publications of authors with the same name

Optimization for Reinforcement Learning: From Single Agent to Cooperative Agents.D. Lee, N. He, P. Kamalaruban, and V. Cevher. CoRR, (2019)TiAda: A Time-scale Adaptive Algorithm for Nonconvex Minimax Optimization.X. Li, J. Yang, and N. He. CoRR, (2022)Optimal Guarantees for Algorithmic Reproducibility and Gradient Complexity in Convex Optimization.L. Zhang, J. Yang, A. Karbasi, and N. He. CoRR, (2023)On the Statistical Efficiency of Mean Field Reinforcement Learning with General Function Approximation.J. Huang, B. Yardim, and N. He. CoRR, (2023)Provably Convergent Policy Optimization via Metric-aware Trust Region Methods.J. Song, N. He, L. Ding, and C. Zhao. CoRR, (2023)Sample Complexity and Overparameterization Bounds for Temporal-Difference Learning With Neural Network Approximation.S. Cayci, S. Satpathi, N. He, and R. Srikant. IEEE Trans. Autom. Control., 68 (5): 2891-2905 (May 2023)Scalable Bayesian Inference via Particle Mirror Descent.B. Dai, N. He, H. Dai, and L. Song. CoRR, (2015)Simulation Studies on Deep Reinforcement Learning for Building Control with Human Interaction.D. Lee, N. He, S. Lee, P. Karava, and J. Hu. CoRR, (2021)Optimization for Reinforcement Learning: From a single agent to cooperative agents.D. Lee, N. He, P. Kamalaruban, and V. Cevher. IEEE Signal Process. Mag., 37 (3): 123-135 (2020)Reinforcement Learning with General Utilities: Simpler Variance Reduction and Large State-Action Space.A. Barakat, I. Fatkhullin, and N. He. ICML, volume 202 of Proceedings of Machine Learning Research, page 1753-1800. PMLR, (2023)

BibSonomy

Disambiguation of "He, Niao"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Cancellation-Free Regret Bounds for Lagrangian Approaches in Constrained Markov Decision Processes.

Please choose a person to relate this publication to

Ning He

Hongxia He

Chunmao He

Xiaowen He

Hucang He

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "He, Niao"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Cancellation-Free Regret Bounds for Lagrangian Approaches in Constrained Markov Decision Processes.

Please choose a person to relate this publication to

Ning He

Hongxia He

Chunmao He

Xiaowen He

Hucang He

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Cancellation-Free Regret Bounds for Lagrangian Approaches in Constrained Markov Decision Processes.