Inproceedings,

A Dynamic and Task-Independent Reward Shaping Approach for Discrete Partially Observable Markov Decision Processes.

, , , , , and .
PAKDD (2), volume 13936 of Lecture Notes in Computer Science, page 337-348. Springer, (2023)

Meta data

Tags

Users

  • @dblp

Comments and Reviews