Author of the publication

Conditional Importance Sampling for Off-Policy Learning.

, , , , , , and . AISTATS, volume 108 of Proceedings of Machine Learning Research, page 45-55. PMLR, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

What Can Learned Intrinsic Rewards Capture?, , , , , , , and . CoRR, (2019)Meta-learning of Sequential Strategies., , , , , , , , , and 14 other author(s). CoRR, (2019)Unicorn: Continual Learning with a Universal, Off-policy Agent., , , , , , , , , and . CoRR, (2018)Conditional Importance Sampling for Off-Policy Learning., , , , , , and . AISTATS, volume 108 of Proceedings of Machine Learning Research, page 45-55. PMLR, (2020)Behaviour Suite for Reinforcement Learning, , , , , , , , , and 4 other author(s). ICLR, (2020)Multi-Task Deep Reinforcement Learning with PopArt., , , , , and . AAAI, page 3796-3803. AAAI Press, (2019)Weighted importance sampling for off-policy learning with linear function approximation., , and . NIPS, page 3014-3022. (2014)What Can Learned Intrinsic Rewards Capture?, , , , , , , and . ICML, volume 119 of Proceedings of Machine Learning Research, page 11436-11446. PMLR, (2020)Behaviour Suite for Reinforcement Learning., , , , , , , , , and 4 other author(s). ICLR, OpenReview.net, (2020)Estimating the Maximum Expected Value: An Analysis of (Nested) Cross Validation and the Maximum Sample Average. CoRR, (2013)