Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits.

B. Kveton, C. Szepesvári, S. Vaswani, Z. Wen, T. Lattimore, and M. Ghavamzadeh. ICML, volume 97 of Proceedings of Machine Learning Research, page 3601-3610. PMLR, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Lavanya Sharan

Shambhu Sharan Sahay

Other publications of authors with the same name

Decision-Aware Actor-Critic with Function Approximation and Theoretical Guarantees.S. Vaswani, A. Kazemi, R. Babanezhad, and N. Roux. CoRR, (2023)Horde of Bandits using Gaussian Markov Random Fields.S. Vaswani, M. Schmidt, and L. Lakshmanan. AISTATS, volume 54 of Proceedings of Machine Learning Research, page 690-699. PMLR, (2017)Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits.B. Kveton, C. Szepesvári, S. Vaswani, Z. Wen, T. Lattimore, and M. Ghavamzadeh. ICML, volume 97 of Proceedings of Machine Learning Research, page 3601-3610. PMLR, (2019)Near-Optimal Sample Complexity Bounds for Constrained MDPs.S. Vaswani, L. Yang, and C. Szepesvári. NeurIPS, (2022)Towards Noise-adaptive, Problem-adaptive (Accelerated) Stochastic Gradient Descent.S. Vaswani, B. Dubois-Taine, and R. Babanezhad. ICML, volume 162 of Proceedings of Machine Learning Research, page 22015-22059. PMLR, (2022)A general class of surrogate functions for stable and efficient reinforcement learning.S. Vaswani, O. Bachem, S. Totaro, R. Müller, S. Garg, M. Geist, M. Machado, P. Castro, and N. Roux. AISTATS, volume 151 of Proceedings of Machine Learning Research, page 8619-8649. PMLR, (2022)Combining Bayesian Optimization and Lipschitz Optimization.M. Ahmed, S. Vaswani, and M. Schmidt. CoRR, (2018)Target-based Surrogates for Stochastic Optimization.J. Lavington, S. Vaswani, R. Harikandeh, M. Schmidt, and N. Roux. ICML, volume 202 of Proceedings of Machine Learning Research, page 18614-18651. PMLR, (2023)Old Dog Learns New Tricks: Randomized UCB for Bandit Problems.S. Vaswani, A. Mehrabian, A. Durand, and B. Kveton. AISTATS, volume 108 of Proceedings of Machine Learning Research, page 1988-1998. PMLR, (2020)Fast and Faster Convergence of SGD for Over-Parameterized Models and an Accelerated Perceptron.S. Vaswani, F. Bach, and M. Schmidt. CoRR, (2018)

BibSonomy

Disambiguation of "Vaswani, Sharan"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits.

Please choose a person to relate this publication to

Lavanya Sharan

Shambhu Sharan Sahay

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Vaswani, Sharan"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits.

Please choose a person to relate this publication to

Lavanya Sharan

Shambhu Sharan Sahay

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Garbage In, Reward Out: Bootstrapping Exploration in Multi-Armed Bandits.