Author of the publication

Constrained Markov Decision Processes via Backward Value Functions.

, , and . ICML, volume 119 of Proceedings of Machine Learning Research, page 8502-8511. PMLR, (2020)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Online Planning Algorithms for POMDPs., , , and . J. Artif. Intell. Res., (2008)A Hierarchical Latent Variable Encoder-Decoder Model for Generating Dialogues., , , , , , and . (2016)cite arxiv:1605.06069Comment: 15 pages, 5 tables, 4 figures.Deep Reinforcement Learning that Matters, , , , , and . (2017)cite arxiv:1709.06560Comment: Accepted to the Thirthy-Second AAAI Conference On Artificial Intelligence (AAAI), 2018.Multi-tasking SLAM., and . ICRA, page 377-384. IEEE, (2010)Designing Intelligent Wheelchairs: Reintegrating AI.. AAAI Spring Symposium: Designing Intelligent Robots, volume SS-13-04 of AAAI Technical Report, AAAI, (2013)Deep Reinforcement Learning That Matters, , , , , and . (2018, 2017)Mobility profile and wheelchair driving skills of powered wheelchair users: Sensor-based event recognition using a support vector machine classifier., , , , , , , , and . EMBC, page 7336-7339. IEEE, (2011)Learning time series models for pedestrian motion prediction., , and . ICRA, page 3323-3330. IEEE, (2016)Modeling Glucagon Action in Patients With Type 1 Diabetes., , , , , and . IEEE J. Biomed. Health Informatics, 21 (4): 1163-1171 (2017)On Overfitting and Asymptotic Bias in Batch Reinforcement Learning with Partial Observability (Extended Abstract)., , , , and . IJCAI, page 5055-5059. ijcai.org, (2020)Journal track.