Learning to Generate Image Source-Agnostic Universal Adversarial Perturbations

Zhao, Pu; Ram, Parikshit; Lu, Songtao; Yao, Yuguang; Bouneffouf, Djallel; Lin, Xue; Liu, Sijia

Computer Science > Machine Learning

arXiv:2009.13714 (cs)

[Submitted on 29 Sep 2020 (v1), last revised 17 Aug 2022 (this version, v4)]

Title:Learning to Generate Image Source-Agnostic Universal Adversarial Perturbations

Authors:Pu Zhao, Parikshit Ram, Songtao Lu, Yuguang Yao, Djallel Bouneffouf, Xue Lin, Sijia Liu

View PDF

Abstract:Adversarial perturbations are critical for certifying the robustness of deep learning models. A universal adversarial perturbation (UAP) can simultaneously attack multiple images, and thus offers a more unified threat model, obviating an image-wise attack algorithm. However, the existing UAP generator is underdeveloped when images are drawn from different image sources (e.g., with different image resolutions). Towards an authentic universality across image sources, we take a novel view of UAP generation as a customized instance of few-shot learning, which leverages bilevel optimization and learning-to-optimize (L2O) techniques for UAP generation with improved attack success rate (ASR). We begin by considering the popular model agnostic meta-learning (MAML) framework to meta-learn a UAP generator. However, we see that the MAML framework does not directly offer the universal attack across image sources, requiring us to integrate it with another meta-learning framework of L2O. The resulting scheme for meta-learning a UAP generator (i) has better performance (50% higher ASR) than baselines such as Projected Gradient Descent, (ii) has better performance (37% faster) than the vanilla L2O and MAML frameworks (when applicable), and (iii) is able to simultaneously handle UAP generation for different victim models and image data sources.

Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI); Computer Vision and Pattern Recognition (cs.CV); Machine Learning (stat.ML)
Cite as:	arXiv:2009.13714 [cs.LG]
	(or arXiv:2009.13714v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2009.13714

Submission history

From: Pu Zhao [view email]
[v1] Tue, 29 Sep 2020 01:23:20 UTC (1,479 KB)
[v2] Tue, 20 Oct 2020 00:47:28 UTC (1,508 KB)
[v3] Mon, 10 May 2021 17:42:57 UTC (4,187 KB)
[v4] Wed, 17 Aug 2022 23:00:11 UTC (4,239 KB)

Computer Science > Machine Learning

Title:Learning to Generate Image Source-Agnostic Universal Adversarial Perturbations

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Learning to Generate Image Source-Agnostic Universal Adversarial Perturbations

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators