From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Distributional Reinforcement Learning with Quantile Regression., , , и . CoRR, (2017)Adaptive Trade-Offs in Off-Policy Learning., , и . AISTATS, том 108 из Proceedings of Machine Learning Research, стр. 34-44. PMLR, (2020)Conditional Importance Sampling for Off-Policy Learning., , , , , , и . AISTATS, том 108 из Proceedings of Machine Learning Research, стр. 45-55. PMLR, (2020)Human Alignment of Large Language Models through Online Preference Optimisation., , , , , , , , , и 3 other автор(ы). CoRR, (2024)Nash Learning from Human Feedback., , , , , , , , , и 7 other автор(ы). CoRR, (2023)Meta-learning of Sequential Strategies., , , , , , , , , и 14 other автор(ы). CoRR, (2019)MICo: Learning improved representations via sampling-based state similarity for Markov decision processes., , , и . CoRR, (2021)Quantile Credit Assignment., , , , , , , , , и 3 other автор(ы). ICML, том 202 из Proceedings of Machine Learning Research, стр. 24517-24531. PMLR, (2023)Learning Dynamics and Generalization in Deep Reinforcement Learning., , , , и . ICML, том 162 из Proceedings of Machine Learning Research, стр. 14560-14581. PMLR, (2022)Taylor Expansion of Discount Factors., , , и . ICML, том 139 из Proceedings of Machine Learning Research, стр. 10130-10140. PMLR, (2021)