Author of the publication

Acting Optimally in Partially Observable Stochastic Domains.

, , and . AAAI, page 1023-1028. AAAI Press / The MIT Press, (1994)0-262-61102-3.

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Input Generlization in Delayed Reinforcement Learning: An Algorithm and Performance Comparisons, and . Proc.\ 12th International Joint Conf.\ on Artificial Intelligence (IJCAI-91), Sydney, Australia, (1991)FFRob: An Efficient Heuristic for Task and Motion Planning., , and . WAFR, volume 107 of Springer Tracts in Advanced Robotics, page 179-195. Springer, (2014)Learning Probabilistic Relational Dynamics for Multiple Tasks, , , and . CoRR, (2012)Generalization in Deep Learning, , and . (2017)cite arxiv:1710.05468Comment: To appear in Mathematics of Deep Learning, Cambridge University Press. All previous results remain unchanged.Object-Based World Modeling in Semi-Static Environments with Dependent Dirichlet Process Mixtures., , , and . IJCAI, page 3513-3521. IJCAI/AAAI Press, (2016)Optimization in the now: Dynamic peephole optimization for hierarchical planning., , and . ICRA, page 4560-4567. IEEE, (2013)On reinforcement learning for robots.. IROS, page 1319-1320. IEEE, (1996)Hierarchical planning for multi-contact non-prehensile manipulation., , and . IROS, page 264-271. IEEE, (2015)GLIB: Efficient Exploration for Relational Model-Based Reinforcement Learning via Goal-Literal Babbling., , , , and . AAAI, page 11782-11791. AAAI Press, (2021)Solving Very Large Weakly Coupled Markov Decision Processes., , , , , , and . AAAI/IAAI, page 165-172. AAAI Press / The MIT Press, (1998)