Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Input Generlization in Delayed Reinforcement Learning: An Algorithm and Performance Comparisons

D. Chapman, and L. Kaelbling. Proc.\ 12th International Joint Conf.\ on Artificial Intelligence (IJCAI-91), Sydney, Australia, (1991)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Leslie Melters

Leslie Schlüter

Leslie Knolle

Leslie Tramontini

Leslie Heckmann

Other publications of authors with the same name

FFRob: An Efficient Heuristic for Task and Motion Planning.C. Garrett, T. Lozano-Pérez, and L. Kaelbling. WAFR, volume 107 of Springer Tracts in Advanced Robotics, page 179-195. Springer, (2014)Input Generlization in Delayed Reinforcement Learning: An Algorithm and Performance ComparisonsD. Chapman, and L. Kaelbling. Proc.\ 12th International Joint Conf.\ on Artificial Intelligence (IJCAI-91), Sydney, Australia, (1991)Heuristic search of multiagent influence space.S. Witwicki, F. Oliehoek, and L. Kaelbling. AAMAS, page 973-980. IFAAMAS, (2012)Learning Probabilistic Relational Dynamics for Multiple TasksA. Deshpande, B. Milch, L. Zettlemoyer, and L. Kaelbling. CoRR, (2012)Intelligent Interaction with the Real World.L. Kaelbling. ECML/PKDD (1), volume 6321 of Lecture Notes in Computer Science, page 3. Springer, (2010)Partially Observable Markov Decision Processes for Artificial Intelligence.L. Kaelbling, M. Littman, and A. Cassandra. KI, volume 981 of Lecture Notes in Computer Science, page 1-17. Springer, (1995)The National Science Foundation Workshop on Reinforcement Learning.S. Mahadevan, and L. Kaelbling. AI Magazine, 17 (4): 89-93 (1996)Generalization in Deep LearningK. Kawaguchi, L. Kaelbling, and Y. Bengio. (2017)cite arxiv:1710.05468Comment: To appear in Mathematics of Deep Learning, Cambridge University Press. All previous results remain unchanged.Optimization in the now: Dynamic peephole optimization for hierarchical planning.D. Hadfield-Menell, L. Kaelbling, and T. Lozano-Pérez. ICRA, page 4560-4567. IEEE, (2013)Hierarchical planning for multi-contact non-prehensile manipulation.G. Lee, T. Lozano-Pérez, and L. Kaelbling. IROS, page 264-271. IEEE, (2015)

BibSonomy

Disambiguation of "Kaelbling, Leslie Pack"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Input Generlization in Delayed Reinforcement Learning: An Algorithm and Performance Comparisons

Please choose a person to relate this publication to

Leslie Melters

Leslie Schlüter

Leslie Knolle

Leslie Tramontini

Leslie Heckmann

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Kaelbling, Leslie Pack"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Input Generlization in Delayed Reinforcement Learning: An Algorithm and Performance Comparisons

Please choose a person to relate this publication to

Leslie Melters

Leslie Schlüter

Leslie Knolle

Leslie Tramontini

Leslie Heckmann

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Input Generlization in Delayed Reinforcement Learning: An Algorithm and Performance Comparisons