Inproceedings,

A Greedy Approach to Adapting the Trace Parameter for Temporal Difference Learning.

, and .
AAMAS, page 557-565. ACM, (2016)

Meta data

Tags

Users

  • @dblp

Comments and Reviews