From post

Combining Pessimism with Optimism for Robust and Efficient Model-Based Deep Reinforcement Learning.

, , и . ICML, том 139 из Proceedings of Machine Learning Research, стр. 2254-2264. PMLR, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Learning to Compensate Photovoltaic Power Fluctuations from Images of the Sky by Imitating an Optimal Policy., , , и . CoRR, (2018)Streaming submodular maximization: massive data summarization on the fly., , , и . KDD, стр. 671-680. ACM, (2014)Intelligent light control using sensor networks., , , , и . SenSys, стр. 218-229. ACM, (2005)Corruption-Tolerant Gaussian Process Bandit Optimization., , и . AISTATS, том 108 из Proceedings of Machine Learning Research, стр. 1071-1081. PMLR, (2020)Mixed-Variable Bayesian Optimization., , , и . IJCAI, стр. 2633-2639. ijcai.org, (2020)Scheduled for July 2020, Yokohama, Japan, postponed due to the Corona pandemic..Safe non-smooth black-box optimization with application to policy search., , и . L4DC, том 120 из Proceedings of Machine Learning Research, стр. 980-989. PMLR, (2020)Safe Convex Learning under Uncertain Constraints., , и . AISTATS, том 89 из Proceedings of Machine Learning Research, стр. 2106-2114. PMLR, (2019)A Comparison of Market Structures with Near-Zero-Intelligence Traders., и . IDEAL, том 5788 из Lecture Notes in Computer Science, стр. 703-710. Springer, (2009)Safe Risk-Averse Bayesian Optimization for Controller Tuning., , , , , и . IEEE Robotics Autom. Lett., 8 (12): 8208-8215 (декабря 2023)Data-Efficient Task Generalization via Probabilistic Model-based Meta Reinforcement Learning., , , , , , и . CoRR, (2023)