From post

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed.

 

Другие публикации лиц с тем же именем

A General Language Assistant as a Laboratory for Alignment., , , , , , , , , и 12 other автор(ы). CoRR, (2021)Red Teaming Language Models to Reduce Harms: Methods, Scaling Behaviors, and Lessons Learned., , , , , , , , , и 26 other автор(ы). CoRR, (2022)Predictability and Surprise in Large Generative Models., , , , , , , , , и 20 other автор(ы). CoRR, (2022)Sleeper Agents: Training Deceptive LLMs that Persist Through Safety Training., , , , , , , , , и 29 other автор(ы). CoRR, (2024)Specific versus General Principles for Constitutional AI., , , , , , , , , и 26 other автор(ы). CoRR, (2023)In-context Learning and Induction Heads., , , , , , , , , и 16 other автор(ы). CoRR, (2022)Discovering Language Model Behaviors with Model-Written Evaluations., , , , , , , , , и 53 other автор(ы). ACL (Findings), стр. 13387-13434. Association for Computational Linguistics, (2023)Constitutional AI: Harmlessness from AI Feedback., , , , , , , , , и 41 other автор(ы). CoRR, (2022)Discovering Language Model Behaviors with Model-Written Evaluations., , , , , , , , , и 53 other автор(ы). CoRR, (2022)Predictability and Surprise in Large Generative Models., , , , , , , , , и 20 other автор(ы). FAccT, стр. 1747-1764. ACM, (2022)