Inproceedings,

V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control.

H. Song, A. Abdolmaleki, J. Springenberg, A. Clark, H. Soyer, J. Rae, S. Noury, A. Ahuja, S. Liu, D. Tirumala, N. Heess, D. Belov, M. Riedmiller, and M. Botvinick.
ICLR, OpenReview.net, (2020)

Meta data

BibTeX key: conf/iclr/SongASCSRNALTHB20
entry type: inproceedings
booktitle: ICLR
year: 2020
publisher: OpenReview.net
crossref: conf/iclr/2020
ee: https://openreview.net/forum?id=SylOlp4FvH
url: http://dblp.uni-trier.de/db/conf/iclr/iclr2020.html#SongASCSRNALTHB20

Tags

dblp

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

search on