Inproceedings,

A Dynamic and Task-Independent Reward Shaping Approach for Discrete Partially Observable Markov Decision Processes.

S. Nahali, H. Ayadi, J. Huang, E. Pakizeh, M. Pedram, and L. Safari.
PAKDD (2), volume 13936 of Lecture Notes in Computer Science, page 337-348. Springer, (2023)

Meta data

BibTeX key: conf/pakdd/NahaliAHPPS23
entry type: inproceedings
booktitle: PAKDD (2)
year: 2023
pages: 337-348
publisher: Springer
series: Lecture Notes in Computer Science
volume: 13936
crossref: conf/pakdd/2023-2
ee: https://doi.org/10.1007/978-3-031-33377-4_26
isbn: 978-3-031-33377-4
url: http://dblp.uni-trier.de/db/conf/pakdd/pakdd2023-2.html#NahaliAHPPS23

Tags

dblp

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

search on