Inproceedings,

Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution.

V. Patil, M. Hofmarcher, M. Dinu, M. Dorfer, P. Blies, J. Brandstetter, J. Arjona-Medina, and S. Hochreiter.
ICML, volume 162 of Proceedings of Machine Learning Research, page 17531-17572. PMLR, (2022)

Meta data

BibTeX key: conf/icml/PatilHDDBBAH22
entry type: inproceedings
booktitle: ICML
year: 2022
pages: 17531-17572
publisher: PMLR
series: Proceedings of Machine Learning Research
volume: 162
crossref: conf/icml/2022
ee: https://proceedings.mlr.press/v162/patil22a.html
url: http://dblp.uni-trier.de/db/conf/icml/icml2022.html#PatilHDDBBAH22

Tags

dblp

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

search on