Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Training a robot with evaluative feedback and unlabeled guidance signals.

A. Najar, O. Sigaud, and M. Chetouani. RO-MAN, page 261-266. IEEE, (2016)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Olivier Coutand

Paul Olivier

Christoph Olivier

Olivier Brede

Hans Olivier

Other publications of authors with the same name

Les systèmes de classeurs.O. Sigaud. Rev. d'Intelligence Artif., 21 (1): 75-106 (2007)Social-Task Learning for HRI.A. Najar, O. Sigaud, and M. Chetouani. ICSR, volume 9388 of Lecture Notes in Computer Science, page 472-481. Springer, (2015)Anticipatory Behavior: Exploiting Knowledge About the Future to Improve Current Behavior.M. Butz, O. Sigaud, and P. Gérard. ABiALS, volume 2684 of Lecture Notes in Computer Science, page 1-10. Springer, (2003)TIRL: Enriching Actor-Critic RL with non-expert human teachers and a Trust Model.F. Rutard, O. Sigaud, and M. Chetouani. RO-MAN, page 604-611. IEEE, (2020)Grounding Large Language Models in Interactive Environments with Online Reinforcement Learning.T. Carta, C. Romac, T. Wolf, S. Lamprier, O. Sigaud, and P. Oudeyer. ICML, volume 202 of Proceedings of Machine Learning Research, page 3676-3713. PMLR, (2023)Pragmatically Learning from Pedagogical Demonstrations in Multi-Goal Environments.H. Caselles-Dupré, O. Sigaud, and M. Chetouani. NeurIPS, (2022)Chi-square Tests Driven Method for Learning the Structure of Factored MDPs.T. Degris, O. Sigaud, and P. Wuillemin. UAI, AUAI Press, (2006)Function Approximation with LWPR and XCSF: a Comparative StudyP. Stalph, J. Rubinsztajn, O. Sigaud, and M. Butz. Evolutionary Intelligence, 5 (2): 103--116 (2012)A comparative study: function approximation with LWPR and XCSF.P. Stalph, J. Rubinsztajn, O. Sigaud, and M. Butz. GECCO (Companion), page 1863-1870. ACM, (2010)Unsupervised Learning of Goal Spaces for Intrinsically Motivated Goal Exploration.A. Péré, S. Forestier, O. Sigaud, and P. Oudeyer. ICLR (Poster), OpenReview.net, (2018)

BibSonomy

Disambiguation of "Sigaud, Olivier"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Training a robot with evaluative feedback and unlabeled guidance signals.

Please choose a person to relate this publication to

Olivier Coutand

Paul Olivier

Christoph Olivier

Olivier Brede

Hans Olivier

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Sigaud, Olivier"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Training a robot with evaluative feedback and unlabeled guidance signals.

Please choose a person to relate this publication to

Olivier Coutand

Paul Olivier

Christoph Olivier

Olivier Brede

Hans Olivier

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Training a robot with evaluative feedback and unlabeled guidance signals.