From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

AgentStudio: A Toolkit for Building General Virtual Agents., , , , , и . CoRR, (2024)Regret Minimization Experience Replay in Off-Policy Reinforcement Learning., , , , , и . NeurIPS, стр. 17604-17615. (2021)State Regularized Policy Optimization on Data with Dynamics Shift., , , , , , и . CoRR, (2023)Two-Stage Constrained Actor-Critic for Short Video Recommendation., , , , , , , , , и 2 other автор(ы). WWW, стр. 865-875. ACM, (2023)AdaRec: Adaptive Sequential Recommendation for Reinforcing Long-term User Engagement., , , , , , , и . CoRR, (2023)MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning., , , , и . CoRR, (2021)Regret Minimization Experience Replay., , , , , и . CoRR, (2021)PrefRec: Preference-based Recommender Systems for Reinforcing Long-term User Engagement., , , , , , , и . CoRR, (2022)PrefRec: Recommender Systems with Human Preferences for Reinforcing Long-term User Engagement., , , , , , , , и . KDD, стр. 2874-2884. ACM, (2023)Guarded Policy Optimization with Imperfect Online Demonstrations., , , , и . ICLR, OpenReview.net, (2023)