Author of the publication

Optimistic Policy Optimization via Multiple Importance Sampling.

, , , and . ICML, volume 97 of Proceedings of Machine Learning Research, page 4989-4999. PMLR, (2019)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Gradient-Aware Model-based Policy Search., , , , and . CoRR, (2019)Importance Sampling Techniques for Policy Optimization., , , and . J. Mach. Learn. Res., (2020)Autoregressive Bandits., , , , , , and . CoRR, (2022)ARLO: A Framework for Automated Reinforcement Learning., , , , and . CoRR, (2022)Towards Theoretical Understanding of Inverse Reinforcement Learning., , and . ICML, volume 202 of Proceedings of Machine Learning Research, page 24555-24591. PMLR, (2023)On the Relation between Policy Improvement and Off-Policy Minimum-Variance Policy Evaluation., , and . UAI, volume 216 of Proceedings of Machine Learning Research, page 1423-1433. PMLR, (2023)Compatible Reward Inverse Reinforcement Learning., , and . NIPS, page 2050-2059. (2017)Trust Region Meta Learning for Policy Optimization., , , and . Meta-Knowledge Transfer @ ECML/PKDD, volume 191 of Proceedings of Machine Learning Research, page 62-74. PMLR, (2022)Pure Exploration under Mediators' Feedback., , and . CoRR, (2023)Optimizing Empty Container Repositioning and Fleet Deployment via Configurable Semi-POMDPs., , , and . CoRR, (2022)