Author of the publication

SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding.

, , , , , and . ACL (1), page 5587-5605. Association for Computational Linguistics, (2024)

Please choose a person to relate this publication to

To differ between persons with the same name, the academic degree and the title of an important publication will be displayed. You can also use the button next to the name to display some publications already assigned to the person.

 

Other publications of authors with the same name

LDL: A Defense for Label-Based Membership Inference Attacks., , , , and . AsiaCCS, page 95-108. ACM, (2023)Scalable Planning in Multi-Agent MDPs., , , and . CDC, page 5932-5939. IEEE, (2021)A Submodular Energy Function Approach to Controlled Islanding with Provable Stability., , , and . CDC, page 7635-7642. IEEE, (2023)A Compositional Resilience Index for Computationally Efficient Safety Analysis of Interconnected Systems., , , , and . CDC, page 7554-7561. IEEE, (2023)Privacy-Preserving Resilience of Cyber-Physical Systems to Adversaries., , , , and . CDC, page 3785-3792. IEEE, (2020)A Submodular Optimization Approach to Stable and Minimally Disruptive Controlled Islanding in Power Systems., , , and . ACC, page 4587-4594. IEEE, (2022)CleanGen: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models., , , , , , and . CoRR, (2024)POSTER: Game of Trojans: Adaptive Adversaries Against Output-based Trojaned-Model Detectors., , , , , , and . AsiaCCS, ACM, (2024)FedGame: A Game-Theoretic Defense against Backdoor Attacks in Federated Learning., , , , , , , and . NeurIPS, (2023)Learning Dissemination Strategies for External Sources in Opinion Dynamic Models with Cognitive Biases., , , , and . IJCAI, page 3-11. ijcai.org, (2023)