From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

TRAD: Enhancing LLM Agents with Step-Wise Thought Retrieval and Aligned Decision., , , , , , , , и . CoRR, (2024)Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning., , , , , , и . CoRR, (2021)MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning., , , , , , , , , и . J. Mach. Learn. Res., (2023)Offline Pre-trained Multi-agent Decision Transformer., , , , , , , , , и 1 other автор(ы). Mach. Intell. Res., 20 (2): 233-248 (апреля 2023)Large sequence models for sequential decision-making: a survey., , , , , , , , и . Frontiers Comput. Sci., 17 (6): 176349 (декабря 2023)Offline Pre-trained Multi-Agent Decision Transformer: One Big Sequence Model Tackles All SMAC Tasks., , , , , , , , , и . CoRR, (2021)MALib: A Parallel Framework for Population-based Multi-agent Reinforcement Learning., , , , , , , , и . CoRR, (2021)Multi-Agent Reinforcement Learning is a Sequence Modeling Problem., , , , , , и . CoRR, (2022)Entropy-Regularized Token-Level Policy Optimization for Large Language Models., , , , и . CoRR, (2024)Alphazero-like Tree-Search can Guide Large Language Model Decoding and Training., , , , , и . CoRR, (2023)