Non-Markovian Policies in Sequential Decision Problems

Abstract

In this article we prove the validity of the Bellman Optimality Equation and related results for sequential decision problems with a general recursive structure. The characteristic feature of our approach is that also non-Markovian policies are taken into account. The theory is motivated by some experiments with a learning robot.

BibTeX key: szepesvari1998a
entry type: article
year: 1998
journal: Acta Cybernetica
number: 3
pages: 305--318
volume: 13
pdf: papers/accyb97.ps.pdf
date-modified: 2010-09-02 13:09:16 -0600
date-added: 2010-08-28 17:38:14 -0600

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

BibSonomy

Non-Markovian Policies in Sequential Decision Problems

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on