From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

On Reinforcement Learning and Distribution Matching for Fine-Tuning Language Models with no Catastrophic Forgetting., , , и . NeurIPS, (2022)RL with KL penalties is better viewed as Bayesian inference., , и . EMNLP (Findings), стр. 1083-1091. Association for Computational Linguistics, (2022)Controlling Conditional Language Models without Catastrophic Forgetting., , , и . ICML, том 162 из Proceedings of Machine Learning Research, стр. 11499-11528. PMLR, (2022)The Emergence of Action-grounded Compositional Communication., , , , , и . CogSci, cognitivesciencesociety.org, (2020)Fine-Tuning Tree-LSTM for Phrase-Level Sentiment Classification on a Polish Dependency Treebank., и . LCT, том 12598 из Lecture Notes in Computer Science, стр. 31-42. Springer, (2017)Aligning Language Models with Preferences through f-divergence Minimization., , , , , и . ICML, том 202 из Proceedings of Machine Learning Research, стр. 11546-11583. PMLR, (2023)Towards Understanding Sycophancy in Language Models., , , , , , , , , и 9 other автор(ы). CoRR, (2023)Catalytic Role Of Noise And Necessity Of Inductive Biases In The Emergence Of Compositional Communication., , , и . NeurIPS, стр. 23075-23088. (2021)Foundational Challenges in Assuring Alignment and Safety of Large Language Models., , , , , , , , , и 28 other автор(ы). CoRR, (2024)Energy-Based Models for Code Generation under Compilability Constraints., , , и . CoRR, (2021)