Author of the publication

Exploration Conscious Reinforcement Learning Revisited.

, , and . ICML, volume 97 of Proceedings of Machine Learning Research, page 5680-5689. PMLR, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Bandits with partially observable confounded data., , , and . UAI, volume 161 of Proceedings of Machine Learning Research, page 430-439. AUAI Press, (2021)Principled Offline RL in the Presence of Rich Exogenous Information., , , , , , , , , and 1 other author(s). ICML, volume 202 of Proceedings of Machine Learning Research, page 14390-14421. PMLR, (2023)Provable Reinforcement Learning with a Short-Term Memory., , , and . ICML, volume 162 of Proceedings of Machine Learning Research, page 5832-5850. PMLR, (2022)Guaranteed Discovery of Control-Endogenous Latent States with Multi-Step Inverse Models., , , , , , , , , and . Trans. Mach. Learn. Res., (2023)Tight Regret Bounds for Model-Based Reinforcement Learning with Greedy Policies., , , and . NeurIPS, page 12203-12213. (2019)Reinforcement Learning in Reward-Mixing MDPs., , , and . NeurIPS, page 2253-2264. (2021)Confidence-Budget Matching for Sequential Budgeted Learning., , , and . ICML, volume 139 of Proceedings of Machine Learning Research, page 2937-2947. PMLR, (2021)Exploration Conscious Reinforcement Learning Revisited., , and . ICML, volume 97 of Proceedings of Machine Learning Research, page 5680-5689. PMLR, (2019)Reinforcement Learning with Trajectory Feedback., , and . AAAI, page 7288-7295. AAAI Press, (2021)Tractable Optimality in Episodic Latent MABs., , , and . NeurIPS, (2022)