Article,

AutoDAN: Automatic and Interpretable Adversarial Attacks on Large Language Models.

S. Zhu, R. Zhang, B. An, G. Wu, J. Barrow, Z. Wang, F. Huang, A. Nenkova, and T. Sun.
CoRR, (2023)

Meta data

BibTeX key: journals/corr/abs-2310-15140
entry type: article
year: 2023
journal: CoRR
volume: abs/2310.15140
ee: https://doi.org/10.48550/arXiv.2310.15140
url: http://dblp.uni-trier.de/db/journals/corr/corr2310.html#abs-2310-15140

Tags

dblp

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

search on