Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Acting Optimally in Partially Observable Stochastic Domains.

A. Cassandra, L. Kaelbling, and M. Littman. AAAI, page 1023-1028. AAAI Press / The MIT Press, (1994)0-262-61102-3.

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Leslie Melters

Leslie Schlüter

Leslie Knolle

Leslie Tramontini

Leslie Heckmann

Other publications of authors with the same name

Input Generlization in Delayed Reinforcement Learning: An Algorithm and Performance ComparisonsD. Chapman, and L. Kaelbling. Proc.\ 12th International Joint Conf.\ on Artificial Intelligence (IJCAI-91), Sydney, Australia, (1991)FFRob: An Efficient Heuristic for Task and Motion Planning.C. Garrett, T. Lozano-Pérez, and L. Kaelbling. WAFR, volume 107 of Springer Tracts in Advanced Robotics, page 179-195. Springer, (2014)Learning Probabilistic Relational Dynamics for Multiple TasksA. Deshpande, B. Milch, L. Zettlemoyer, and L. Kaelbling. CoRR, (2012)Generalization in Deep LearningK. Kawaguchi, L. Kaelbling, and Y. Bengio. (2017)cite arxiv:1710.05468Comment: To appear in Mathematics of Deep Learning, Cambridge University Press. All previous results remain unchanged.Object-Based World Modeling in Semi-Static Environments with Dependent Dirichlet Process Mixtures.L. Wong, T. Kurutach, T. Lozano-Pérez, and L. Kaelbling. IJCAI, page 3513-3521. IJCAI/AAAI Press, (2016)Optimization in the now: Dynamic peephole optimization for hierarchical planning.D. Hadfield-Menell, L. Kaelbling, and T. Lozano-Pérez. ICRA, page 4560-4567. IEEE, (2013)On reinforcement learning for robots.L. Kaelbling. IROS, page 1319-1320. IEEE, (1996)Hierarchical planning for multi-contact non-prehensile manipulation.G. Lee, T. Lozano-Pérez, and L. Kaelbling. IROS, page 264-271. IEEE, (2015)GLIB: Efficient Exploration for Relational Model-Based Reinforcement Learning via Goal-Literal Babbling.R. Chitnis, T. Silver, J. Tenenbaum, L. Kaelbling, and T. Lozano-Pérez. AAAI, page 11782-11791. AAAI Press, (2021)Solving Very Large Weakly Coupled Markov Decision Processes.N. Meuleau, M. Hauskrecht, K. Kim, L. Peshkin, L. Kaelbling, T. Dean, and C. Boutilier. AAAI/IAAI, page 165-172. AAAI Press / The MIT Press, (1998)

BibSonomy

Disambiguation of "Kaelbling, Leslie Pack"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Acting Optimally in Partially Observable Stochastic Domains.

Please choose a person to relate this publication to

Leslie Melters

Leslie Schlüter

Leslie Knolle

Leslie Tramontini

Leslie Heckmann

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Kaelbling, Leslie Pack"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Acting Optimally in Partially Observable Stochastic Domains.

Please choose a person to relate this publication to

Leslie Melters

Leslie Schlüter

Leslie Knolle

Leslie Tramontini

Leslie Heckmann

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Acting Optimally in Partially Observable Stochastic Domains.