Article,

Maximum Likelihood Constraint Inference for Inverse Reinforcement Learning

D. Scobee, and S. Sastry.
(2019)cite arxiv:1909.05477Comment: Published as a conference paper at the International Conference on Learning Representations (ICLR), 2020 (at https://openreview.net/forum?id=BJliakStvH ).

Abstract

While most approaches to the problem of Inverse Reinforcement Learning (IRL) focus on estimating a reward function that best explains an expert agent's policy or demonstrated behavior on a control task, it is often the case that such behavior is more succinctly represented by a simple reward combined with a set of hard constraints. In this setting, the agent is attempting to maximize cumulative rewards subject to these given constraints on their behavior. We reformulate the problem of IRL on Markov Decision Processes (MDPs) such that, given a nominal model of the environment and a nominal reward function, we seek to estimate state, action, and feature constraints in the environment that motivate an agent's behavior. Our approach is based on the Maximum Entropy IRL framework, which allows us to reason about the likelihood of an expert agent's demonstrations given our knowledge of an MDP. Using our method, we can infer which constraints can be added to the MDP to most increase the likelihood of observing these demonstrations. We present an algorithm which iteratively infers the Maximum Likelihood Constraint to best explain observed behavior, and we evaluate its efficacy using both simulated behavior and recorded data of humans navigating around an obstacle.

BibTeX key: scobee2019maximum
entry type: article
year: 2019
url: http://arxiv.org/abs/1909.05477
note: cite arxiv:1909.05477Comment: Published as a conference paper at the International Conference on Learning Representations (ICLR), 2020 (at https://openreview.net/forum?id=BJliakStvH )

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

@article{scobee2019maximum, abstract = {While most approaches to the problem of Inverse Reinforcement Learning (IRL) focus on estimating a reward function that best explains an expert agent's policy or demonstrated behavior on a control task, it is often the case that such behavior is more succinctly represented by a simple reward combined with a set of hard constraints. In this setting, the agent is attempting to maximize cumulative rewards subject to these given constraints on their behavior. We reformulate the problem of IRL on Markov Decision Processes (MDPs) such that, given a nominal model of the environment and a nominal reward function, we seek to estimate state, action, and feature constraints in the environment that motivate an agent's behavior. Our approach is based on the Maximum Entropy IRL framework, which allows us to reason about the likelihood of an expert agent's demonstrations given our knowledge of an MDP. Using our method, we can infer which constraints can be added to the MDP to most increase the likelihood of observing these demonstrations. We present an algorithm which iteratively infers the Maximum Likelihood Constraint to best explain observed behavior, and we evaluate its efficacy using both simulated behavior and recorded data of humans navigating around an obstacle.}, added-at = {2020-05-15T14:31:46.000+0200}, author = {Scobee, Dexter R. R. and Sastry, S. Shankar}, biburl = {https://www.bibsonomy.org/bibtex/2592852a79c9d46970480a382984cce90/kirk86}, description = {[1909.05477] Maximum Likelihood Constraint Inference for Inverse Reinforcement Learning}, interhash = {b71a7bc5edbeedc9ad2b6b19438e7aa0}, intrahash = {592852a79c9d46970480a382984cce90}, keywords = {constrains inference inverse reinforcement-learning}, note = {cite arxiv:1909.05477Comment: Published as a conference paper at the International Conference on Learning Representations (ICLR), 2020 (at https://openreview.net/forum?id=BJliakStvH )}, timestamp = {2020-05-15T14:31:46.000+0200}, title = {Maximum Likelihood Constraint Inference for Inverse Reinforcement Learning}, url = {http://arxiv.org/abs/1909.05477}, year = 2019 }

BibSonomy

Maximum Likelihood Constraint Inference for Inverse Reinforcement Learning

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on