Article,

Learning Stochastic Optimal Policies via Gradient Descent.

S. Massaroli, M. Poli, S. Peluchetti, J. Park, A. Yamashita, and H. Asama.
CoRR, (2021)

Meta data

BibTeX key: journals/corr/abs-2106-03780
entry type: article
year: 2021
journal: CoRR
volume: abs/2106.03780
ee: https://arxiv.org/abs/2106.03780
url: http://dblp.uni-trier.de/db/journals/corr/corr2106.html#abs-2106-03780

Tags

dblp

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

search on