Inproceedings,

Interactive Multi-objective Reinforcement Learning in Multi-armed Bandits with Gaussian Process Utility Models.

, , , , , and .
ECML/PKDD (3), volume 12459 of Lecture Notes in Computer Science, page 463-478. Springer, (2020)

Meta data

Tags

Users

  • @dblp

Comments and Reviews