From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

Teaching Large Language Models to Reason with Reinforcement Learning., , , , , , , , и . CoRR, (2024)Understanding the Effects of RLHF on LLM Generalisation and Diversity., , , , , , и . CoRR, (2023)LLaMA: Open and Efficient Foundation Language Models., , , , , , , , , и 4 other автор(ы). CoRR, (2023)Generalization to New Sequential Decision Making Tasks with In-Context Learning., , , , и . ICML, OpenReview.net, (2024)Dungeons and Data: A Large-Scale NetHack Dataset., , , , , , и . NeurIPS, (2022)Know When To Stop: A Study of Semantic Drift in Text Generation., , , и . NAACL-HLT, стр. 3656-3671. Association for Computational Linguistics, (2024)LLaMA: Open and Efficient Foundation Language Models, , , , , , , , , и 4 other автор(ы). CoRR, (2023)Llama: Open and efficient foundation language models, , , , , , , , , и 1 other автор(ы). arXiv preprint arXiv:2302.13971, (2023)Understanding the Effects of RLHF on LLM Generalisation and Diversity., , , , , , и . ICLR, OpenReview.net, (2024)Insights From the NeurIPS 2021 NetHack Challenge., , , , , , , , , и 19 other автор(ы). NeurIPS (Competition and Demos), том 176 из Proceedings of Machine Learning Research, стр. 41-52. PMLR, (2021)