Inproceedings,

Optimal Estimation of Policy Gradient via Double Fitted Iteration.

C. Ni, R. Zhang, X. Ji, X. Zhang, and M. Wang.
ICML, volume 162 of Proceedings of Machine Learning Research, page 16724-16783. PMLR, (2022)

Meta data

BibTeX key: conf/icml/NiZJZW22
entry type: inproceedings
booktitle: ICML
year: 2022
pages: 16724-16783
publisher: PMLR
series: Proceedings of Machine Learning Research
volume: 162
crossref: conf/icml/2022
ee: https://proceedings.mlr.press/v162/ni22b.html
url: http://dblp.uni-trier.de/db/conf/icml/icml2022.html#NiZJZW22

Tags

dblp

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

search on