Inproceedings,

Optimality and Approximation with Policy Gradient Methods in Markov Decision Processes.

, , , and .
COLT, volume 125 of Proceedings of Machine Learning Research, page 64-66. PMLR, (2020)

Meta data

Tags

Users

  • @dblp

Comments and Reviews