Author of the publication

Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits.

, , , , , and . ICML, volume 97 of Proceedings of Machine Learning Research, page 3601-3610. PMLR, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Pseudo-MDPs and factored linear action models., , , and . ADPRL, page 1-9. IEEE, (2014)Statistical linear estimation with penalized estimators: an application to reinforcement learning., and . ICML, icml.cc / Omnipress, (2012)Learning to segment from a few well-selected training images., , and . ICML, volume 382 of ACM International Conference Proceeding Series, page 305-312. ACM, (2009)Alignment Based Kernel Learning with a Continuous Set of Base Kernels, , and . CoRR, (2011)Manifold-adaptive dimension estimation., , and . ICML, volume 227 of ACM International Conference Proceeding Series, page 265-272. ACM, (2007)Speeding Up Planning in Markov Decision Processes via Automatically Constructed Abstractions, , , and . CoRR, (2012)PAC-Bayesian Policy Evaluation for Reinforcement Learning, , and . CoRR, (2012)On Minimax Optimal Offline Policy Evaluation., , and . CoRR, (2014)Unsupervised Sequential Sensor Acquisition., , and . AISTATS, volume 54 of Proceedings of Machine Learning Research, page 803-811. PMLR, (2017)A Finite-Sample Generalization Bound for Semiparametric Regression: Partially Linear Models., and . AISTATS, volume 33 of JMLR Workshop and Conference Proceedings, page 402-410. JMLR.org, (2014)