Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

PrefRec: Recommender Systems with Human Preferences for Reinforcing Long-term User Engagement.

W. Xue, Q. Cai, Z. Xue, S. Sun, S. Liu, D. Zheng, P. Jiang, K. Gai, and B. An. KDD, page 2874-2884. ACM, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Jian Cai

Yongjie Cai

Lijun Cai

Cai Berg

Xuediao Cai

Other publications of authors with the same name

Policy Optimization with Model-Based Explorations.F. Pan, Q. Cai, A. Zeng, C. Pan, Q. Da, H. He, Q. He, and P. Tang. AAAI, page 4675-4682. AAAI Press, (2019)Softmax Deep Double Deterministic Policy Gradients.L. Pan, Q. Cai, and L. Huang. NeurIPS, (2020)Reinforcing User Retention in a Billion Scale Short Video Recommender System.Q. Cai, S. Liu, X. Wang, T. Zuo, W. Xie, B. Yang, D. Zheng, P. Jiang, and K. Gai. WWW (Companion Volume), page 421-426. ACM, (2023)Two-Stage Constrained Actor-Critic for Short Video Recommendation.Q. Cai, Z. Xue, C. Zhang, W. Xue, S. Liu, R. Zhan, X. Wang, T. Zuo, W. Xie, D. Zheng and 2 other author(s). WWW, page 865-875. ACM, (2023)ResAct: Reinforcing Long-term Engagement in Sequential Recommendation with Residual Actor.W. Xue, Q. Cai, R. Zhan, D. Zheng, P. Jiang, K. Gai, and B. An. ICLR, OpenReview.net, (2023)Sequential Recommendation for Optimizing Both Immediate Feedback and Long-term Retention.Z. Liu, S. Liu, Z. Zhang, Q. Cai, X. Zhao, K. Zhao, L. Hu, P. Jiang, and K. Gai. SIGIR, page 1872-1882. ACM, (2024)AdaRec: Adaptive Sequential Recommendation for Reinforcing Long-term User Engagement.Z. Xue, Q. Cai, T. Zuo, B. Yang, L. Hu, P. Jiang, K. Gai, and B. An. CoRR, (2023)PrefRec: Recommender Systems with Human Preferences for Reinforcing Long-term User Engagement.W. Xue, Q. Cai, Z. Xue, S. Sun, S. Liu, D. Zheng, P. Jiang, K. Gai, and B. An. KDD, page 2874-2884. ACM, (2023)Deterministic Value-Policy Gradients.Q. Cai, L. Pan, and P. Tang. AAAI, page 3316-3323. AAAI Press, (2020)Multi-armed Bandit Mechanism with Private Histories.C. Liu, Q. Cai, and Y. Zhang. AAMAS, page 1607-1609. ACM, (2017)

BibSonomy

Disambiguation of "Cai, Qingpeng"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

PrefRec: Recommender Systems with Human Preferences for Reinforcing Long-term User Engagement.

Please choose a person to relate this publication to

Jian Cai

Yongjie Cai

Lijun Cai

Cai Berg

Xuediao Cai

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Cai, Qingpeng"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML PrefRec: Recommender Systems with Human Preferences for Reinforcing Long-term User Engagement.

Please choose a person to relate this publication to

Jian Cai

Yongjie Cai

Lijun Cai

Cai Berg

Xuediao Cai

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

PrefRec: Recommender Systems with Human Preferences for Reinforcing Long-term User Engagement.