Inproceedings,

Off-Policy Policy Gradient with Stationary Distribution Correction.

, , , and .
UAI, volume 115 of Proceedings of Machine Learning Research, page 1180-1190. AUAI Press, (2019)

Meta data

Tags

Users

  • @dblp

Comments and Reviews