Author of the publication

Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits.

, , , , , and . ICML, volume 97 of Proceedings of Machine Learning Research, page 3601-3610. PMLR, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Decision-Aware Actor-Critic with Function Approximation and Theoretical Guarantees., , , and . CoRR, (2023)Horde of Bandits using Gaussian Markov Random Fields., , and . AISTATS, volume 54 of Proceedings of Machine Learning Research, page 690-699. PMLR, (2017)Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits., , , , , and . ICML, volume 97 of Proceedings of Machine Learning Research, page 3601-3610. PMLR, (2019)Near-Optimal Sample Complexity Bounds for Constrained MDPs., , and . NeurIPS, (2022)Towards Noise-adaptive, Problem-adaptive (Accelerated) Stochastic Gradient Descent., , and . ICML, volume 162 of Proceedings of Machine Learning Research, page 22015-22059. PMLR, (2022)A general class of surrogate functions for stable and efficient reinforcement learning., , , , , , , , and . AISTATS, volume 151 of Proceedings of Machine Learning Research, page 8619-8649. PMLR, (2022)Combining Bayesian Optimization and Lipschitz Optimization., , and . CoRR, (2018)Target-based Surrogates for Stochastic Optimization., , , , and . ICML, volume 202 of Proceedings of Machine Learning Research, page 18614-18651. PMLR, (2023)Old Dog Learns New Tricks: Randomized UCB for Bandit Problems., , , and . AISTATS, volume 108 of Proceedings of Machine Learning Research, page 1988-1998. PMLR, (2020)Fast and Faster Convergence of SGD for Over-Parameterized Models and an Accelerated Perceptron., , and . CoRR, (2018)