Author of the publication

Optimistic PAC Reinforcement Learning: the Instance-Dependent View.

, , and . ALT, volume 201 of Proceedings of Machine Learning Research, page 1460-1480. PMLR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A Practical Algorithm for Multiplayer Bandits when Arm Means Vary Among Players., , , and . AISTATS, volume 108 of Proceedings of Machine Learning Research, page 1211-1221. PMLR, (2020)On Multi-Armed Bandit Designs for Dose-Finding Trials., , and . J. Mach. Learn. Res., (2021)Towards Instance-Optimality in Online PAC Reinforcement Learning., , and . CoRR, (2023)Dealing with Unknown Variances in Best-Arm Identification., , and . ALT, volume 201 of Proceedings of Machine Learning Research, page 776-849. PMLR, (2023)Episodic Reinforcement Learning in Finite MDPs: Minimax Lower Bounds Revisited., , , and . ALT, volume 132 of Proceedings of Machine Learning Research, page 578-598. PMLR, (2021)Kernel-Based Reinforcement Learning: A Finite-Time Analysis., , , , and . ICML, volume 139 of Proceedings of Machine Learning Research, page 2783-2792. PMLR, (2021)Active Coverage for PAC Reinforcement Learning., , and . COLT, volume 195 of Proceedings of Machine Learning Research, page 5044-5109. PMLR, (2023)A Kernel-Based Approach to Non-Stationary Reinforcement Learning in Metric Spaces., , , , and . AISTATS, volume 130 of Proceedings of Machine Learning Research, page 3538-3546. PMLR, (2021)Mixture Martingales Revisited with Applications to Sequential Tests and Confidence Intervals., and . CoRR, (2018)New Algorithms for Multiplayer Bandits when Arm Means Vary Among Players., and . CoRR, (2019)