Author of the publication

FastIF: Scalable Influence Functions for Efficient Model Interpretation and Debugging.

, , , , and . EMNLP (1), page 10333-10350. Association for Computational Linguistics, (2021)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

The Out-of-Distribution Problem in Explainability and Search Methods for Feature Importance Explanations., , and . NeurIPS, page 3650-3666. (2021)Summarization Programs: Interpretable Abstractive Summarization with Neural Modular Trees., , , and . ICLR, OpenReview.net, (2023)Are Hard Examples also Harder to Explain? A Study with Human and Model-Generated Explanations., , , and . EMNLP, page 2121-2131. Association for Computational Linguistics, (2022)Fundamental Problems With Model Editing: How Should Rational Belief Revision Work in LLMs?, , , , and . CoRR, (2024)Can Sensitive Information Be Deleted From LLMs? Objectives for Defending Against Extraction Attacks., , and . ICLR, OpenReview.net, (2024)Methods for Measuring, Updating, and Visualizing Factual Beliefs in Language Models., , , , , , , and . EACL, page 2706-2723. Association for Computational Linguistics, (2023)Evaluating Explainable AI: Which Algorithmic Explanations Help Users Predict Model Behavior?, and . ACL, page 5540-5552. Association for Computational Linguistics, (2020)LACIE: Listener-Aware Finetuning for Confidence Calibration in Large Language Models., , and . CoRR, (2024)Open Problems and Fundamental Limitations of Reinforcement Learning from Human Feedback., , , , , , , , , and 22 other author(s). Trans. Mach. Learn. Res., (2023)Rethinking Machine Unlearning for Large Language Models., , , , , , , , , and 3 other author(s). CoRR, (2024)