From post

копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Large-scale Interactive Conversational Recommendation System using Actor-Critic Framework

A. Montazeralghaem, J. Allan, и P. Thomas. Fifteenth ACM Conference on Recommender Systems, стр. 220-229. ACM, (сентября 2021)
DOI: 10.1145/3460231.3474271

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

Thomas S Rau

Thomas S Eberle

Thomas S Mir

Thomas S Hoffmeister

Thomas S Gerstner

Другие публикации лиц с тем же именем

A Notation for Markov Decision Processes.P. Thomas. CoRR, (2015)Reinforcement Learning When All Actions Are Not Always Available.Y. Chandak, G. Theocharous, B. Metevier, и P. Thomas. AAAI, стр. 3381-3388. AAAI Press, (2020)Increasing the Action Gap: New Operators for Reinforcement Learning.M. Bellemare, G. Ostrovski, A. Guez, P. Thomas, и R. Munos. AAAI, стр. 1476-1483. AAAI Press, (2016)SOPE: Spectrum of Off-Policy Estimators.C. Yuan, Y. Chandak, S. Giguere, P. Thomas, и S. Niekum. NeurIPS, стр. 18958-18969. (2021)Learning Fair Representations with High-Confidence Guarantees.Y. Luo, A. Hoag, и P. Thomas. CoRR, (2023)Optimization using Parallel Gradient Evaluations on Multiple Parameters.Y. Chandak, S. Shankar, V. Gandikota, P. Thomas, и A. Mazumdar. CoRR, (2023)Asymptotically Unbiased Off-Policy Policy Evaluation when Reusing Old Data in Nonstationary Environments.V. Liu, Y. Chandak, P. Thomas, и M. White. AISTATS, том 206 из Proceedings of Machine Learning Research, стр. 5474-5492. PMLR, (2023)Evaluating the Performance of Reinforcement Learning Algorithms.S. Jordan, Y. Chandak, D. Cohen, M. Zhang, и P. Thomas. ICML, том 119 из Proceedings of Machine Learning Research, стр. 4962-4973. PMLR, (2020)Optimizing for the Future in Non-Stationary MDPs.Y. Chandak, G. Theocharous, S. Shankar, M. White, S. Mahadevan, и P. Thomas. ICML, том 119 из Proceedings of Machine Learning Research, стр. 1414-1425. PMLR, (2020)Is the Policy Gradient a Gradient?C. Nota, и P. Thomas. AAMAS, стр. 939-947. International Foundation for Autonomous Agents and Multiagent Systems, (2020)

BibSonomy

Disambiguation