Author of the publication

Value Alignment Verification.

, , , and . ICML, volume 139 of Proceedings of Machine Learning Research, page 1105-1115. PMLR, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

Learning from Extrapolated Corrections., and . ICRA, page 7034-7040. IEEE, (2019)Offline RL with Observation Histories: Analyzing and Improving Sample Complexity., , and . CoRR, (2023)Confronting Reward Model Overoptimization with Constrained RLHF., , , , , , and . CoRR, (2023)Preference learning along multiple criteria: A game-theoretic perspective., , , , and . CoRR, (2021)Zero-Shot Goal-Directed Dialogue via RL on Imagined Conversations., , and . CoRR, (2023)Do You Want Your Autonomous Car To Drive Like You?, , , , and . CoRR, (2018)Managing AI Risks in an Era of Rapid Progress., , , , , , , , , and 14 other author(s). CoRR, (2023)On the Utility of Learning about Humans for Human-AI Coordination., , , , , , and . NeurIPS, page 5175-5186. (2019)Estimating and Penalizing Induced Preference Shifts in Recommender Systems., , , and . ICML, volume 162 of Proceedings of Machine Learning Research, page 2686-2708. PMLR, (2022)Choice Set Misspecification in Reward Inference., , and . AISafety@IJCAI, volume 2640 of CEUR Workshop Proceedings, CEUR-WS.org, (2020)