Article,

Posterior sampling for reinforcement learning: worst-case regret bounds.

, and .
CoRR, (2017)

Meta data

Tags

Users

  • @dblp

Comments and Reviews