From post

EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL.

, , и . ICML, том 139 из Proceedings of Machine Learning Research, стр. 3682-3691. PMLR, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Braxlines: Fast and Interactive Toolkit for RL-driven Behavior Engineering beyond Reward Maximization., , , , , , , , , и . CoRR, (2021)Understanding the Relation Between Maximum-Entropy Inverse Reinforcement Learning and Behaviour Cloning., , и . DGS@ICLR, OpenReview.net, (2019)Gradient-based Optimization of Neural Network Architecture., , , и . ICLR (Workshop), OpenReview.net, (2018)EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL., , и . ICML, том 139 из Proceedings of Machine Learning Research, стр. 3682-3691. PMLR, (2021)SMILe: Scalable Meta Inverse Reinforcement Learning through Context-Conditional Policies., , и . NeurIPS, стр. 7879-7889. (2019)EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL., , и . CoRR, (2020)Bi-Manual Block Assembly via Sim-to-Real Reinforcement Learning., , , , , и . CoRR, (2023)Bi-Manual Manipulation and Attachment via Sim-to-Real Reinforcement Learning., , , и . CoRR, (2022)Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding., , , , , , , , , и 3 other автор(ы). NeurIPS, (2022)A Divergence Minimization Perspective on Imitation Learning Methods., , и . CoRL, том 100 из Proceedings of Machine Learning Research, стр. 1259-1277. PMLR, (2019)