Author of the publication

Input Generlization in Delayed Reinforcement Learning: An Algorithm and Performance Comparisons

, and . Proc.\ 12th International Joint Conf.\ on Artificial Intelligence (IJCAI-91), Sydney, Australia, (1991)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

FFRob: An Efficient Heuristic for Task and Motion Planning., , and . WAFR, volume 107 of Springer Tracts in Advanced Robotics, page 179-195. Springer, (2014)Input Generlization in Delayed Reinforcement Learning: An Algorithm and Performance Comparisons, and . Proc.\ 12th International Joint Conf.\ on Artificial Intelligence (IJCAI-91), Sydney, Australia, (1991)Heuristic search of multiagent influence space., , and . AAMAS, page 973-980. IFAAMAS, (2012)Learning Probabilistic Relational Dynamics for Multiple Tasks, , , and . CoRR, (2012)Intelligent Interaction with the Real World.. ECML/PKDD (1), volume 6321 of Lecture Notes in Computer Science, page 3. Springer, (2010)Partially Observable Markov Decision Processes for Artificial Intelligence., , and . KI, volume 981 of Lecture Notes in Computer Science, page 1-17. Springer, (1995)The National Science Foundation Workshop on Reinforcement Learning., and . AI Magazine, 17 (4): 89-93 (1996)Generalization in Deep Learning, , and . (2017)cite arxiv:1710.05468Comment: To appear in Mathematics of Deep Learning, Cambridge University Press. All previous results remain unchanged.Optimization in the now: Dynamic peephole optimization for hierarchical planning., , and . ICRA, page 4560-4567. IEEE, (2013)Hierarchical planning for multi-contact non-prehensile manipulation., , and . IROS, page 264-271. IEEE, (2015)