Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution.

V. Patil, M. Hofmarcher, M. Dinu, M. Dorfer, P. Blies, J. Brandstetter, J. Arjona-Medina, and S. Hochreiter. ICML, volume 162 of Proceedings of Machine Learning Research, page 17531-17572. PMLR, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Dinu Costa

Claudia Elena Dinu

Georgiana Dinu

Valeriu Dinu

Georgiana Dinu

Other publications of authors with the same name

The balancing principle for parameter choice in distance-regularized domain adaptation.W. Zellinger, N. Shepeleva, M. Dinu, H. Eghbal zadeh, H. Nguyen, B. Nessler, S. Pereverzyev, and B. Moser. NeurIPS, page 20798-20811. (2021)Large Language Models Can Self-Improve At Web Agent Tasks.A. Patel, M. Hofmarcher, C. Leoveanu-Condrei, M. Dinu, C. Callison-Burch, and S. Hochreiter. CoRR, (2024)A Dataset Perspective on Offline Reinforcement Learning.K. Schweighofer, M. Dinu, A. Radler, M. Hofmarcher, V. Patil, A. Bitto-Nemling, H. Eghbal zadeh, and S. Hochreiter. CoLLAs, volume 199 of Proceedings of Machine Learning Research, page 470-517. PMLR, (2022)Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution.V. Patil, M. Hofmarcher, M. Dinu, M. Dorfer, P. Blies, J. Brandstetter, J. Arjona-Medina, and S. Hochreiter. ICML, volume 162 of Proceedings of Machine Learning Research, page 17531-17572. PMLR, (2022)Reactive Exploration to Cope With Non-Stationarity in Lifelong Reinforcement Learning.C. Steinparz, T. Schmied, F. Paischer, M. Dinu, V. Patil, A. Bitto-Nemling, H. Eghbal zadeh, and S. Hochreiter. CoLLAs, volume 199 of Proceedings of Machine Learning Research, page 441-469. PMLR, (2022)XAI and Strategy Extraction via Reward Redistribution.M. Dinu, M. Hofmarcher, V. Patil, M. Dorfer, P. Blies, J. Brandstetter, J. Arjona-Medina, and S. Hochreiter. xxAI@ICML, volume 13200 of Lecture Notes in Computer Science, page 177-205. Springer, (2020)Addressing Parameter Choice Issues in Unsupervised Domain Adaptation by Aggregation.M. Dinu, M. Holzleitner, M. Beck, H. Nguyen, A. Huber, H. Eghbal zadeh, B. Moser, S. Pereverzyev, S. Hochreiter, and W. Zellinger. ICLR, OpenReview.net, (2023)

BibSonomy

Disambiguation of "Dinu, Marius-Constantin"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution.

Please choose a person to relate this publication to

Dinu Costa

Claudia Elena Dinu

Georgiana Dinu

Valeriu Dinu

Georgiana Dinu

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Dinu, Marius-Constantin"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution.

Please choose a person to relate this publication to

Dinu Costa

Claudia Elena Dinu

Georgiana Dinu

Valeriu Dinu

Georgiana Dinu

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution.