Author of the publication

Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution.

, , , , , , , and . ICML, volume 162 of Proceedings of Machine Learning Research, page 17531-17572. PMLR, (2022)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

The balancing principle for parameter choice in distance-regularized domain adaptation., , , , , , , and . NeurIPS, page 20798-20811. (2021)Large Language Models Can Self-Improve At Web Agent Tasks., , , , , and . CoRR, (2024)A Dataset Perspective on Offline Reinforcement Learning., , , , , , , and . CoLLAs, volume 199 of Proceedings of Machine Learning Research, page 470-517. PMLR, (2022)Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution., , , , , , , and . ICML, volume 162 of Proceedings of Machine Learning Research, page 17531-17572. PMLR, (2022)Reactive Exploration to Cope With Non-Stationarity in Lifelong Reinforcement Learning., , , , , , , and . CoLLAs, volume 199 of Proceedings of Machine Learning Research, page 441-469. PMLR, (2022)XAI and Strategy Extraction via Reward Redistribution., , , , , , , and . xxAI@ICML, volume 13200 of Lecture Notes in Computer Science, page 177-205. Springer, (2020)Addressing Parameter Choice Issues in Unsupervised Domain Adaptation by Aggregation., , , , , , , , , and . ICLR, OpenReview.net, (2023)