Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Rlaif: Scaling reinforcement learning from human feedback with ai feedback, , , , , , , and . arXiv preprint arXiv:2309.00267, (2023)Counterfactual Credit Assignment in Model-Free Reinforcement Learning., , , , , , , , , and 4 other author(s). ICML, volume 139 of Proceedings of Machine Learning Research, page 7654-7664. PMLR, (2021)Credit Assignment in Deep Reinforcement Learning. (Attribution de crédit pour l'apprentissage par renforcement dans des réseaux profonds).. Polytechnic Institute of Paris, Palaiseau, France, (2023)Extending the Framework of Equilibrium Propagation to General Dynamics., , , , and . ICLR (Workshop), OpenReview.net, (2018)Nash Learning from Human Feedback., , , , , , , , , and 8 other author(s). ICML, OpenReview.net, (2024)A Survey of Temporal Credit Assignment in Deep Reinforcement Learning., , , , , and . Trans. Mach. Learn. Res., (2024)RecurrentGemma: Moving Past Transformers for Efficient Open Language Models., , , , , , , , , and 52 other author(s). CoRR, (2024)Hindsight Credit Assignment., , , , , , , , , and 1 other author(s). NeurIPS, page 12467-12476. (2019)An objective function for STDP., , , , and . CoRR, (2015)Gemma 2: Improving Open Language Models at a Practical Size., , , , , , , , , and 89 other author(s). CoRR, (2024)