From post

копировать удалить добавить публикацию в буфер
Запись сообщества
посмотреть историю данной записи
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL.

S. Ghasemipour, D. Schuurmans, и S. Gu. ICML, том 139 из Proceedings of Machine Learning Research, стр. 3682-3691. PMLR, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

Schahin Seyed-Mahdavi Ruiz

Mir Fakhreddin Seyedin

Scheida Seyedi

Maryam Sadat Seyed Saleki

Niloufar Sadat Seyedi Fazlollahi

Другие публикации лиц с тем же именем

Braxlines: Fast and Interactive Toolkit for RL-driven Behavior Engineering beyond Reward Maximization.S. Gu, M. Diaz, C. Freeman, H. Furuta, S. Ghasemipour, A. Raichuk, B. David, E. Frey, E. Coumans, и O. Bachem. CoRR, (2021)Understanding the Relation Between Maximum-Entropy Inverse Reinforcement Learning and Behaviour Cloning.S. Ghasemipour, S. Gu, и R. Zemel. DGS@ICLR, OpenReview.net, (2019)Gradient-based Optimization of Neural Network Architecture.W. Grathwohl, E. Creager, S. Ghasemipour, и R. Zemel. ICLR (Workshop), OpenReview.net, (2018)EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL.S. Ghasemipour, D. Schuurmans, и S. Gu. ICML, том 139 из Proceedings of Machine Learning Research, стр. 3682-3691. PMLR, (2021)SMILe: Scalable Meta Inverse Reinforcement Learning through Context-Conditional Policies.S. Ghasemipour, S. Gu, и R. Zemel. NeurIPS, стр. 7879-7889. (2019)EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL.S. Ghasemipour, D. Schuurmans, и S. Gu. CoRR, (2020)Bi-Manual Block Assembly via Sim-to-Real Reinforcement Learning.S. Kataoka, Y. Chung, S. Ghasemipour, P. Sanketi, S. Gu, и I. Mordatch. CoRR, (2023)Bi-Manual Manipulation and Attachment via Sim-to-Real Reinforcement Learning.S. Kataoka, S. Ghasemipour, C. Freeman, и I. Mordatch. CoRR, (2022)Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding.C. Saharia, W. Chan, S. Saxena, L. Li, J. Whang, E. Denton, S. Ghasemipour, R. Lopes, B. Ayan, T. Salimans и 3 other автор(ы). NeurIPS, (2022)A Divergence Minimization Perspective on Imitation Learning Methods.S. Ghasemipour, R. Zemel, и S. Gu. CoRL, том 100 из Proceedings of Machine Learning Research, стр. 1259-1277. PMLR, (2019)

BibSonomy

Disambiguation