Misc,

Distributionally Adversarial Attack

T. Zheng, C. Chen, and K. Ren.
(2018)cite arxiv:1808.05537Comment: accepted to AAAI-19.

Abstract

Recent work on adversarial attack has shown that Projected Gradient Descent (PGD) Adversary is a universal first-order adversary, and the classifier adversarially trained by PGD is robust against a wide range of first-order attacks. It is worth noting that the original objective of an attack/defense model relies on a data distribution $p(x)$, typically in the form of risk maximization/minimization, e.g., $\max/\minE_p(\mathbf(x))L(x)$ with $p(x)$ some unknown data distribution and $L(\cdot)$ a loss function. However, since PGD generates attack samples independently for each data sample based on $L(\cdot)$, the procedure does not necessarily lead to good generalization in terms of risk optimization. In this paper, we achieve the goal by proposing distributionally adversarial attack (DAA), a framework to solve an optimal adversarial-data distribution, a perturbed distribution that satisfies the $L_ınfty$ constraint but deviates from the original data distribution to increase the generalization risk maximally. Algorithmically, DAA performs optimization on the space of potential data distributions, which introduces direct dependency between all data points when generating adversarial samples. DAA is evaluated by attacking state-of-the-art defense models, including the adversarially-trained models provided by MIT MadryLab. Notably, DAA ranks the first place on MadryLab's white-box leaderboards, reducing the accuracy of their secret MNIST model to $88.79\%$ (with $l_ınfty$ perturbations of $= 0.3$) and the accuracy of their secret CIFAR model to $44.71\%$ (with $l_ınfty$ perturbations of $= 8.0$). Code for the experiments is released on https://github.com/tianzheng4/Distributionally-Adversarial-Attack.

BibTeX key: zheng2018distributionally
entry type: misc
year: 2018
url: http://arxiv.org/abs/1808.05537
note: cite arxiv:1808.05537Comment: accepted to AAAI-19

Users

Comments and Reviewsshow / hide

Please log in to take part in the discussion (add own reviews or comments).

Cite this publication

@misc{zheng2018distributionally, abstract = {Recent work on adversarial attack has shown that Projected Gradient Descent (PGD) Adversary is a universal first-order adversary, and the classifier adversarially trained by PGD is robust against a wide range of first-order attacks. It is worth noting that the original objective of an attack/defense model relies on a data distribution $p(\mathbf{x})$, typically in the form of risk maximization/minimization, e.g., $\max/\min\mathbb{E}_{p(\mathbf(x))}\mathcal{L}(\mathbf{x})$ with $p(\mathbf{x})$ some unknown data distribution and $\mathcal{L}(\cdot)$ a loss function. However, since PGD generates attack samples independently for each data sample based on $\mathcal{L}(\cdot)$, the procedure does not necessarily lead to good generalization in terms of risk optimization. In this paper, we achieve the goal by proposing distributionally adversarial attack (DAA), a framework to solve an optimal {\em adversarial-data distribution}, a perturbed distribution that satisfies the $L_\infty$ constraint but deviates from the original data distribution to increase the generalization risk maximally. Algorithmically, DAA performs optimization on the space of potential data distributions, which introduces direct dependency between all data points when generating adversarial samples. DAA is evaluated by attacking state-of-the-art defense models, including the adversarially-trained models provided by {\em MIT MadryLab}. Notably, DAA ranks {\em the first place} on MadryLab's white-box leaderboards, reducing the accuracy of their secret MNIST model to $88.79\%$ (with $l_\infty$ perturbations of $\epsilon = 0.3$) and the accuracy of their secret CIFAR model to $44.71\%$ (with $l_\infty$ perturbations of $\epsilon = 8.0$). Code for the experiments is released on \url{https://github.com/tianzheng4/Distributionally-Adversarial-Attack}.}, added-at = {2019-02-28T18:29:12.000+0100}, author = {Zheng, Tianhang and Chen, Changyou and Ren, Kui}, biburl = {https://www.bibsonomy.org/bibtex/2aa9f5dd508b53e34165948f2e0de71af/dijksteruis}, description = {Distributionally Adversarial Attack}, interhash = {652765a4626faaf8dc233f15bea73e20}, intrahash = {aa9f5dd508b53e34165948f2e0de71af}, keywords = {image}, note = {cite arxiv:1808.05537Comment: accepted to AAAI-19}, timestamp = {2019-02-28T18:29:12.000+0100}, title = {Distributionally Adversarial Attack}, url = {http://arxiv.org/abs/1808.05537}, year = 2018 }

BibSonomy

Distributionally Adversarial Attack

Abstract

Tags

Users

Comments and Reviewsshow / hide

Cite this publication

More citation styles

search on