From post

копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Uncertainty-Aware Low-Rank Q-Matrix Estimation for Deep Reinforcement Learning.

T. Sang, H. Tang, J. Hao, Y. Zheng, и Z. Meng. DAI, том 13170 из Lecture Notes in Computer Science, стр. 21-37. Springer, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

Jianye Hao

Jianye Yang

Wenjin Hao

Xiaofei Hao

Jin Hao

Другие публикации лиц с тем же именем

Learning When to Transfer among Agents: An Efficient Multiagent Transfer Learning Framework.T. Yang, W. Wang, H. Tang, J. Hao, Z. Meng, W. Liu, Y. Hu, и Y. Chen. CoRR, (2020)The Dynamics of Reinforcement Social Learning in Cooperative Multiagent Systems.J. Hao, и H. fung Leung. IJCAI, стр. 184-190. IJCAI/AAAI, (2013)Plan Your Target and Learn Your Skills: Transferable State-Only Imitation Learning via Decoupled Policy Optimization.M. Liu, Z. Zhu, Y. Zhuang, W. Zhang, J. Hao, Y. Yu, и J. Wang. ICML, том 162 из Proceedings of Machine Learning Research, стр. 14173-14196. PMLR, (2022)Online Ad Hoc Teamwork under Partial Observability.P. Gu, M. Zhao, J. Hao, и B. An. ICLR, OpenReview.net, (2022)Probabilistic Model Checking Multi-agent Behaviors in Dispersion Games Using Counter Abstraction.J. Hao, S. Song, Y. Liu, J. Sun, L. Gui, J. Dong, и H. fung Leung. PRIMA, том 7455 из Lecture Notes in Computer Science, стр. 16-30. Springer, (2012)Off-Policy Training for Truncated TD(λ) Boosted Soft Actor-Critic.S. Huang, B. Wang, H. Su, D. Li, J. Hao, J. Zhu, и T. Chen. PRICAI (3), том 13033 из Lecture Notes in Computer Science, стр. 46-59. Springer, (2021)Towards Effective Context for Meta-Reinforcement Learning: an Approach based on Contrastive Learning.H. Fu, H. Tang, J. Hao, C. Chen, X. Feng, D. Li, и W. Liu. AAAI, стр. 7457-7465. AAAI Press, (2021)Foresee then Evaluate: Decomposing Value Estimation with Latent Future Prediction.H. Tang, Z. Meng, G. Chen, P. Chen, C. Chen, Y. Yang, L. Zhang, W. Liu, и J. Hao. AAAI, стр. 9834-9842. AAAI Press, (2021)Structure Aware Incremental Learning with Personalized Imitation Weights for Recommender Systems.Y. Wang, Y. Zhang, A. Valkanas, R. Tang, C. Ma, J. Hao, и M. Coates. AAAI, стр. 4711-4719. AAAI Press, (2023)A Unified Framework for Layout Pattern Analysis with Deep Causal Estimation.R. Chen, S. Hu, Z. Chen, S. Zhu, B. Yu, P. Li, C. Chen, Y. Huang, и J. Hao. ICCAD, стр. 1-9. IEEE, (2021)

BibSonomy

Disambiguation