Author of the publication

Training a robot with evaluative feedback and unlabeled guidance signals.

, , and . RO-MAN, page 261-266. IEEE, (2016)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Les systèmes de classeurs.. Rev. d'Intelligence Artif., 21 (1): 75-106 (2007)Social-Task Learning for HRI., , and . ICSR, volume 9388 of Lecture Notes in Computer Science, page 472-481. Springer, (2015)Anticipatory Behavior: Exploiting Knowledge About the Future to Improve Current Behavior., , and . ABiALS, volume 2684 of Lecture Notes in Computer Science, page 1-10. Springer, (2003)TIRL: Enriching Actor-Critic RL with non-expert human teachers and a Trust Model., , and . RO-MAN, page 604-611. IEEE, (2020)Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning., , , , , and . ICML, volume 202 of Proceedings of Machine Learning Research, page 3676-3713. PMLR, (2023)Pragmatically Learning from Pedagogical Demonstrations in Multi-Goal Environments., , and . NeurIPS, (2022)Chi-square Tests Driven Method for Learning the Structure of Factored MDPs., , and . UAI, AUAI Press, (2006)Function Approximation with LWPR and XCSF: a Comparative Study, , , and . Evolutionary Intelligence, 5 (2): 103--116 (2012)A comparative study: function approximation with LWPR and XCSF., , , and . GECCO (Companion), page 1863-1870. ACM, (2010)Unsupervised Learning of Goal Spaces for Intrinsically Motivated Goal Exploration., , , and . ICLR (Poster), OpenReview.net, (2018)