From post

Large-scale Interactive Conversational Recommendation System using Actor-Critic Framework

, , и . Fifteenth ACM Conference on Recommender Systems, стр. 220-229. ACM, (сентября 2021)
DOI: 10.1145/3460231.3474271

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

A Notation for Markov Decision Processes.. CoRR, (2015)Reinforcement Learning When All Actions Are Not Always Available., , , и . AAAI, стр. 3381-3388. AAAI Press, (2020)Increasing the Action Gap: New Operators for Reinforcement Learning., , , , и . AAAI, стр. 1476-1483. AAAI Press, (2016)SOPE: Spectrum of Off-Policy Estimators., , , , и . NeurIPS, стр. 18958-18969. (2021)Learning Fair Representations with High-Confidence Guarantees., , и . CoRR, (2023)Optimization using Parallel Gradient Evaluations on Multiple Parameters., , , , и . CoRR, (2023)Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments., , , и . AISTATS, том 206 из Proceedings of Machine Learning Research, стр. 5474-5492. PMLR, (2023)Evaluating the Performance of Reinforcement Learning Algorithms., , , , и . ICML, том 119 из Proceedings of Machine Learning Research, стр. 4962-4973. PMLR, (2020)Optimizing for the Future in Non-Stationary MDPs., , , , , и . ICML, том 119 из Proceedings of Machine Learning Research, стр. 1414-1425. PMLR, (2020)Is the Policy Gradient a Gradient?, и . AAMAS, стр. 939-947. International Foundation for Autonomous Agents and Multiagent Systems, (2020)