From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Near-optimal reinforcement learning in polynomial time, и . Proc. 15th International Conf. on Machine Learning, стр. 260--268. Morgan Kaufmann, San Francisco, CA, (1998)Between MDPs and Semi-MDPs: learning, planning, and representing knowledge at multiple temporal scales, , и . 98-74. University of Massachusetts, Amherst, MA 01003, (1998)Semantics and algorithms for trustworthy commitment achievement under model uncertainty., , и . Auton. Agents Multi Agent Syst., 34 (1): 19 (2020)Bootstrapped Meta-Learning., , , , , и . CoRR, (2021)Object-oriented state editing for HRL., , , , , , и . CoRR, (2019)Blockchain and Deep Learning for Secure Communication in Digital Twin Empowered Industrial IoT Network., , , , , и . IEEE Trans. Netw. Sci. Eng., 10 (5): 2802-2813 (сентября 2023)Discovering Diverse Nearly Optimal Policies withSuccessor Features., , , , , и . CoRR, (2021)Reinforcement Learning for Sparse-Reward Object-Interaction Tasks in First-person Simulated 3D Environments., , , , , , и . CoRR, (2020)In-context Reinforcement Learning with Algorithm Distillation., , , , , , , , , и 4 other автор(ы). CoRR, (2022)Disentangled Cumulants Help Successor Representations Transfer to New Tasks., , , , , , , и . CoRR, (2019)