Article,

Posterior sampling for reinforcement learning: worst-case regret bounds.

S. Agrawal, and R. Jia.
CoRR, (2017)

Meta data

BibTeX key: journals/corr/AgrawalJ17
entry type: article
year: 2017
journal: CoRR
volume: abs/1705.07041
ee: http://arxiv.org/abs/1705.07041
url: http://dblp.uni-trier.de/db/journals/corr/corr1705.html#AgrawalJ17

Tags

dblp

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

search on