Inproceedings,

Interactive Multi-objective Reinforcement Learning in Multi-armed Bandits with Gaussian Process Utility Models.

D. Roijers, L. Zintgraf, P. Libin, M. Reymond, E. Bargiacchi, and A. Nowé.
ECML/PKDD (3), volume 12459 of Lecture Notes in Computer Science, page 463-478. Springer, (2020)

Meta data

BibTeX key: conf/pkdd/RoijersZLRBN20
entry type: inproceedings
booktitle: ECML/PKDD (3)
year: 2020
pages: 463-478
publisher: Springer
series: Lecture Notes in Computer Science
volume: 12459
crossref: conf/pkdd/2020-3
ee: https://doi.org/10.1007/978-3-030-67664-3_28
isbn: 978-3-030-67664-3
url: http://dblp.uni-trier.de/db/conf/pkdd/pkdd2020-3.html#RoijersZLRBN20

Tags

dblp

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

search on