Author of the publication

Navigating the OverKill in Large Language Models.

, , , , , , , , , and . ACL (1), page 4602-4614. Association for Computational Linguistics, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models., , , , , , , , , and 2 other author(s). CoRR, (2023)Linear Alignment: A Closed-form Solution for Aligning Human Preferences without Tuning and Feedback., , , , , , , , , and 2 other author(s). ICML, OpenReview.net, (2024)CausalAPM: Generalizable Literal Disentanglement for NLU Debiasing., , , and . NLPCC (1), volume 15359 of Lecture Notes in Computer Science, page 284-297. Springer, (2024)Secrets of RLHF in Large Language Models Part II: Reward Modeling., , , , , , , , , and 17 other author(s). CoRR, (2024)On the Universal Adversarial Perturbations for Efficient Data-free Adversarial Detection., , , , , and . ACL (Findings), page 13573-13581. Association for Computational Linguistics, (2023)RealBehavior: A Framework for Faithfully Characterizing Foundation Models' Human-like Behavior Mechanisms., , , , , , , , , and . EMNLP (Findings), page 10262-10274. Association for Computational Linguistics, (2023)Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement., , , , , , , , and . EMNLP (Findings), page 11383-11406. Association for Computational Linguistics, (2023)Chinese Tiny LLM: Pretraining a Chinese-Centric Large Language Model., , , , , , , , , and 6 other author(s). CoRR, (2024)AgentGym: Evolving Large Language Model-based Agents across Diverse Environments., , , , , , , , , and 10 other author(s). CoRR, (2024)LoRAMoE: Alleviating World Knowledge Forgetting in Large Language Models via MoE-Style Plugin., , , , , , , , , and 6 other author(s). ACL (1), page 1932-1945. Association for Computational Linguistics, (2024)