From post

Uncertainty-Aware Low-Rank Q-Matrix Estimation for Deep Reinforcement Learning.

, , , , и . DAI, том 13170 из Lecture Notes in Computer Science, стр. 21-37. Springer, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Learning When to Transfer among Agents: An Efficient Multiagent Transfer Learning Framework., , , , , , , и . CoRR, (2020)The Dynamics of Reinforcement Social Learning in Cooperative Multiagent Systems., и . IJCAI, стр. 184-190. IJCAI/AAAI, (2013)Plan Your Target and Learn Your Skills: Transferable State-Only Imitation Learning via Decoupled Policy Optimization., , , , , , и . ICML, том 162 из Proceedings of Machine Learning Research, стр. 14173-14196. PMLR, (2022)Online Ad Hoc Teamwork under Partial Observability., , , и . ICLR, OpenReview.net, (2022)Probabilistic Model Checking Multi-agent Behaviors in Dispersion Games Using Counter Abstraction., , , , , , и . PRIMA, том 7455 из Lecture Notes in Computer Science, стр. 16-30. Springer, (2012)Off-Policy Training for Truncated TD(λ) Boosted Soft Actor-Critic., , , , , , и . PRICAI (3), том 13033 из Lecture Notes in Computer Science, стр. 46-59. Springer, (2021)Towards Effective Context for Meta-Reinforcement Learning: an Approach based on Contrastive Learning., , , , , , и . AAAI, стр. 7457-7465. AAAI Press, (2021)Foresee then Evaluate: Decomposing Value Estimation with Latent Future Prediction., , , , , , , , и . AAAI, стр. 9834-9842. AAAI Press, (2021)Structure Aware Incremental Learning with Personalized Imitation Weights for Recommender Systems., , , , , , и . AAAI, стр. 4711-4719. AAAI Press, (2023)A Unified Framework for Layout Pattern Analysis with Deep Causal Estimation., , , , , , , , и . ICCAD, стр. 1-9. IEEE, (2021)