Author of the publication

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting., , , and . NeurIPS, (2022)Towards Understanding Sycophancy in Language Models., , , , , , , , , and 9 other author(s). CoRR, (2023)Catalytic Role Of Noise And Necessity Of Inductive Biases In The Emergence Of Compositional Communication., , , and . NeurIPS, page 23075-23088. (2021)Inverse Scaling: When Bigger Isn't Better., , , , , , , , , and 17 other author(s). Trans. Mach. Learn. Res., (2023)Foundational Challenges in Assuring Alignment and Safety of Large Language Models., , , , , , , , , and 28 other author(s). CoRR, (2024)Aligning Language Models with Preferences through f-divergence Minimization., , , , , and . ICML, volume 202 of Proceedings of Machine Learning Research, page 11546-11583. PMLR, (2023)Fine-Tuning Tree-LSTM for Phrase-Level Sentiment Classification on a Polish Dependency Treebank., and . LCT, volume 12598 of Lecture Notes in Computer Science, page 31-42. Springer, (2017)Learning from Natural Language Feedback., , , , , , , and . Trans. Mach. Learn. Res., (2024)Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback., , , , , , , , , and 22 other author(s). Trans. Mach. Learn. Res., (2023)The Reversal Curse: LLMs trained on Ä is B" fail to learn "B is A"., , , , , , and . ICLR, OpenReview.net, (2024)