@dblp

Sample-Efficient Reinforcement Learning Based on Dynamics Models via Meta-policy Optimization.

, , , and . ICCSIP, volume 1515 of Communications in Computer and Information Science, page 360-373. Springer, (2021)

Links and resources

Tags