Author of the publication

PrefRec: Recommender Systems with Human Preferences for Reinforcing Long-term User Engagement.

, , , , , , , , and . KDD, page 2874-2884. ACM, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Policy Optimization with Model-Based Explorations., , , , , , , and . AAAI, page 4675-4682. AAAI Press, (2019)Softmax Deep Double Deterministic Policy Gradients., , and . NeurIPS, (2020)Reinforcing User Retention in a Billion Scale Short Video Recommender System., , , , , , , , and . WWW (Companion Volume), page 421-426. ACM, (2023)Two-Stage Constrained Actor-Critic for Short Video Recommendation., , , , , , , , , and 2 other author(s). WWW, page 865-875. ACM, (2023)ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor., , , , , , and . ICLR, OpenReview.net, (2023)Sequential Recommendation for Optimizing Both Immediate Feedback and Long-term Retention., , , , , , , , and . SIGIR, page 1872-1882. ACM, (2024)AdaRec: Adaptive Sequential Recommendation for Reinforcing Long-term User Engagement., , , , , , , and . CoRR, (2023)PrefRec: Recommender Systems with Human Preferences for Reinforcing Long-term User Engagement., , , , , , , , and . KDD, page 2874-2884. ACM, (2023)Deterministic Value-Policy Gradients., , and . AAAI, page 3316-3323. AAAI Press, (2020)Multi-armed Bandit Mechanism with Private Histories., , and . AAMAS, page 1607-1609. ACM, (2017)