Author of the publication

SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding.

, , , , , and . ACL (1), page 5587-5605. Association for Computational Linguistics, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

CleanGen: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models., , , , , , and . CoRR, (2024)POSTER: Identifying and Mitigating Vulnerabilities in LLM-Integrated Applications., , , , , , and . AsiaCCS, ACM, (2024)Brave: Byzantine-Resilient and Privacy-Preserving Peer-to-Peer Federated Learning., , , , and . CoRR, (2024)ChatBug: A Common Vulnerability of Aligned LLMs Induced by Chat Templates., , , , and . CoRR, (2024)ACE: A Model Poisoning Attack on Contribution Evaluation Methods in Federated Learning., , , , , and . USENIX Security Symposium, USENIX Association, (2024)Poster: Brave: Byzantine-Resilient and Privacy-Preserving Peer-to-Peer Federated Learning., , , , and . AsiaCCS, ACM, (2024)ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs., , , , , , and . ACL (1), page 15157-15173. Association for Computational Linguistics, (2024)Exact Fault-Tolerant Consensus with Voting Validity., , , and . IPDPS, page 842-852. IEEE, (2023)Magpie: Alignment Data Synthesis from Scratch by Prompting Aligned LLMs with Nothing., , , , , , and . CoRR, (2024)Identifying and Mitigating Vulnerabilities in LLM-Integrated Applications., , , , , , and . CoRR, (2023)