Inproceedings,

Constructing Stochastic Mixture Policies for Episodic Multiobjective Reinforcement Learning Tasks.

P. Vamplew, R. Dazeley, E. Barker, and A. Kelarev.
Australasian Conference on Artificial Intelligence, volume 5866 of Lecture Notes in Computer Science, page 340-349. Springer, (2009)

Meta data

BibTeX key: conf/ausai/VamplewDBK09
entry type: inproceedings
booktitle: Australasian Conference on Artificial Intelligence
year: 2009
pages: 340-349
publisher: Springer
series: Lecture Notes in Computer Science
volume: 5866
crossref: conf/ausai/2009
ee: https://doi.org/10.1007/978-3-642-10439-8_35
isbn: 978-3-642-10438-1
url: http://dblp.uni-trier.de/db/conf/ausai/ausai2009.html#VamplewDBK09

Tags

dblp

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

search on