Author of the publication

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Quantile Credit Assignment.

T. Mesnard, W. Chen, A. Saade, Y. Tang, M. Rowland, T. Weber, C. Lyle, A. Gruslys, M. Valko, W. Dabney, G. Ostrovski, E. Moulines, and R. Munos. ICML, volume 202 of Proceedings of Machine Learning Research, page 24517-24531. PMLR, (2023)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

Thomas Thomas

Other publications of authors with the same name

Rlaif: Scaling reinforcement learning from human feedback with ai feedbackH. Lee, S. Phatale, H. Mansoor, K. Lu, T. Mesnard, C. Bishop, V. Carbune, and A. Rastogi. arXiv preprint arXiv:2309.00267, (2023)Counterfactual Credit Assignment in Model-Free Reinforcement Learning.T. Mesnard, T. Weber, F. Viola, S. Thakoor, A. Saade, A. Harutyunyan, W. Dabney, T. Stepleton, N. Heess, A. Guez and 4 other author(s). ICML, volume 139 of Proceedings of Machine Learning Research, page 7654-7664. PMLR, (2021)Credit Assignment in Deep Reinforcement Learning. (Attribution de crédit pour l'apprentissage par renforcement dans des réseaux profonds).T. Mesnard. Polytechnic Institute of Paris, Palaiseau, France, (2023)Extending the Framework of Equilibrium Propagation to General Dynamics.B. Scellier, A. Goyal, J. Binas, T. Mesnard, and Y. Bengio. ICLR (Workshop), OpenReview.net, (2018)Nash Learning from Human Feedback.R. Munos, M. Valko, D. Calandriello, M. Azar, M. Rowland, Z. Guo, Y. Tang, M. Geist, T. Mesnard, C. Fiegel and 8 other author(s). ICML, OpenReview.net, (2024)A Survey of Temporal Credit Assignment in Deep Reinforcement Learning.E. Pignatelli, J. Ferret, M. Geist, T. Mesnard, H. van Hasselt, and L. Toni. Trans. Mach. Learn. Res., (2024)RecurrentGemma: Moving Past Transformers for Efficient Open Language Models.A. Botev, S. De, S. Smith, A. Fernando, G. Muraru, R. Haroun, L. Berrada, R. Pascanu, P. Sessa, R. Dadashi and 52 other author(s). CoRR, (2024)Hindsight Credit Assignment.A. Harutyunyan, W. Dabney, T. Mesnard, M. Azar, B. Piot, N. Heess, H. van Hasselt, G. Wayne, S. Singh, D. Precup and 1 other author(s). NeurIPS, page 12467-12476. (2019)An objective function for STDP.Y. Bengio, T. Mesnard, A. Fischer, S. Zhang, and Y. Wu. CoRR, (2015)Gemma 2: Improving Open Language Models at a Practical Size.M. Rivière, S. Pathak, P. Sessa, C. Hardin, S. Bhupatiraju, L. Hussenot, T. Mesnard, B. Shahriari, A. Ramé, J. Ferret and 89 other author(s). CoRR, (2024)

BibSonomy

Disambiguation of "Mesnard, Thomas"

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Quantile Credit Assignment.

Please choose a person to relate this publication to

Thomas Thomas

Thomas Harasim

Thomas Baron

Thomas Böttcher

Thomas Potinecke

Other publications of authors with the same name

Disambiguation

BibSonomy

Disambiguation of "Mesnard, Thomas"

copydeleteadd this publication to your clipboardcommunity posthistory of this postURLDOIBibTeXEndNoteAPAChicagoDIN 1505HarvardMSOffice XML Quantile Credit Assignment.

Please choose a person to relate this publication to

Thomas Thomas

Thomas Harasim

Thomas Baron

Thomas Böttcher

Thomas Potinecke

Other publications of authors with the same name

Disambiguation

copy delete add this publication to your clipboard
community post
history of this post
URL
DOI
BibTeX
EndNote
APA
Chicago
DIN 1505
Harvard
MSOffice XML

Quantile Credit Assignment.