Inproceedings,

Optimality and Approximation with Policy Gradient Methods in Markov Decision Processes.

A. Agarwal, S. Kakade, J. Lee, and G. Mahajan.
COLT, volume 125 of Proceedings of Machine Learning Research, page 64-66. PMLR, (2020)

Meta data

BibTeX key: conf/colt/AgarwalKLM20
entry type: inproceedings
booktitle: COLT
year: 2020
pages: 64-66
publisher: PMLR
series: Proceedings of Machine Learning Research
volume: 125
crossref: conf/colt/2020
ee: http://proceedings.mlr.press/v125/agarwal20a.html
url: http://dblp.uni-trier.de/db/conf/colt/colt2020.html#AgarwalKLM20

Tags

dblp

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

search on