From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Dissimilar Nodes Improve Graph Active Learning., , , , , и . CoRR, (2022)MENTOR: Guiding Hierarchical Reinforcement Learning with Human Feedback and Dynamic Distance Constraint., , , и . CoRR, (2024)Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback., , , , , , , , и . CoRR, (2024)MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL., , , , , , и . ICML, том 202 из Proceedings of Machine Learning Research, стр. 26087-26105. PMLR, (2023)AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model., , , , , , , , , и . CoRR, (2023)EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model., , , , , , , , и . CoRR, (2022)SheetAgent: A Generalist Agent for Spreadsheet Reasoning and Manipulation via Large Language Models., , , , , , и . CoRR, (2024)DiffuserLite: Towards Real-time Diffusion Planning., , , , , , и . CoRR, (2024)Explicit Dynamic Coordination Reinforcement Learning Based on Utility., , , , и . KSII Trans. Internet Inf. Syst., 16 (3): 792-812 (2022)Enhancing Robotic Manipulation with AI Feedback from Multimodal Large Language Models., , , , , , и . CoRR, (2024)