Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Universal Option Models., , , , and . NIPS, page 990-998. (2014)Reinforcing Classical Planning for Adversary Driving Scenarios., , and . CoRR, (2019)The Sufficiency of Off-Policyness and Soft Clipping: PPO Is Still Insufficient according to an Off-Policy Measure., , , , , , , , , and . AAAI, page 7078-7086. AAAI Press, (2023)Pseudo-MDPs and factored linear action models., , , and . ADPRL, page 1-9. IEEE, (2014)Multi-Step Dyna Planning for Policy Evaluation and Control., , , , and . NIPS, page 2187-2195. Curran Associates, Inc., (2009)Breaking the Deadly Triad with a Target Network., , and . ICML, volume 139 of Proceedings of Machine Learning Research, page 12621-12631. PMLR, (2021)Provably Convergent Two-Timescale Off-Policy Actor-Critic with Function Approximation., , , and . ICML, volume 119 of Proceedings of Machine Learning Research, page 11204-11213. PMLR, (2020)Understanding and mitigating the limitations of prioritized experience replay., , , , , , and . UAI, volume 180 of Proceedings of Machine Learning Research, page 1561-1571. PMLR, (2022)QUOTA: The Quantile Option Architecture for Reinforcement Learning., and . AAAI, page 5797-5804. AAAI Press, (2019)Historical Temporal Difference Learning: Some Initial Results., , and . IMSCCS (2), page 678-685. IEEE Computer Society, (2006)0-7695-2581-4.