Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

A Unified View on Solving Objective Mismatch in Model-Based Reinforcement Learning., , , , and . Trans. Mach. Learn. Res., (2024)Zephyr: Direct Distillation of LM Alignment., , , , , , , , , and 4 other author(s). CoRR, (2023)Entangled Preferences: The History and Risks of Reinforcement Learning and Human Feedback., , and . CoRR, (2023)Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research., , , , , , , , , and 26 other author(s). ACL (1), page 15725-15788. Association for Computational Linguistics, (2024)Towards a Framework for Openness in Foundation Models: Proceedings from the Columbia Convening on Openness in Artificial Intelligence., , , , , , , , , and 6 other author(s). CoRR, (2024)D2PO: Discriminator-Guided DPO with Response Evaluation Models., , , , and . CoRR, (2024)Camels in a Changing Climate: Enhancing LM Adaptation with Tulu 2., , , , , , , , , and 1 other author(s). CoRR, (2023)Measuring Data., , , , , , , , , and . CoRR, (2022)Position: Social Choice Should Guide AI Alignment in Dealing with Diverse Human Feedback., , , , , , , , , and 2 other author(s). ICML, OpenReview.net, (2024)RewardBench: Evaluating Reward Models for Language Modeling., , , , , , , , , and 2 other author(s). CoRR, (2024)