Author of the publication

Reinforcement Learning with a Disentangled Universal Value Function for Item Recommendation.

, , , , , , , and . AAAI, page 4427-4435. AAAI Press, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Efficient policy detecting and reusing for non-stationarity in Markov games., , , , , , and . Auton. Agents Multi Agent Syst., 35 (1): 2 (2021)Value Function Transfer for Deep Multi-Agent Reinforcement Learning Based on N-Step Returns., , , , and . IJCAI, page 457-463. ijcai.org, (2019)Prioritized Trajectory Replay: A Replay Memory for Data-driven Reinforcement Learning., , , , , , and . CoRR, (2023)Semi-Supervised Learning for In-Game Expert-Level Music-to-Dance Translation., , , , , , , , and . CoRR, (2020)Learn to Effectively Explore in Context-Based Meta-RL., , , , , and . CoRR, (2020)Easy and Efficient Transformer : Scalable Inference Solution For large NLP mode., , , , , , , and . CoRR, (2021)Off-Beat Multi-Agent Reinforcement Learning., , , , , , , , , and . CoRR, (2022)Towards a Simultaneous and Granular Identity-Expression Control in Personalized Face Generation., , , , , , , and . CoRR, (2024)Write-a-speaker: Text-based Emotional and Rhythmic Talking-head Generation., , , , , , and . CoRR, (2021)Distributed Multi-Robot Obstacle Avoidance via Logarithmic Map-based Deep Reinforcement Learning., , , , , and . CoRR, (2022)