@dblp

Off-Policy Policy Gradient with Stationary Distribution Correction.

, , , and . UAI, volume 115 of Proceedings of Machine Learning Research, page 1180-1190. AUAI Press, (2019)

Links and resources

Tags