VPA: Fully Test-Time Visual Prompt Adaptation

Sun, Jiachen; Ibrahim, Mark; Hall, Melissa; Evtimov, Ivan; Mao, Z. Morley; Ferrer, Cristian Canton; Hazirbas, Caner

Computer Science > Computer Vision and Pattern Recognition

arXiv:2309.15251 (cs)

[Submitted on 26 Sep 2023]

Title:VPA: Fully Test-Time Visual Prompt Adaptation

Authors:Jiachen Sun, Mark Ibrahim, Melissa Hall, Ivan Evtimov, Z. Morley Mao, Cristian Canton Ferrer, Caner Hazirbas

View PDF

Abstract:Textual prompt tuning has demonstrated significant performance improvements in adapting natural language processing models to a variety of downstream tasks by treating hand-engineered prompts as trainable parameters. Inspired by the success of textual prompting, several studies have investigated the efficacy of visual prompt tuning. In this work, we present Visual Prompt Adaptation (VPA), the first framework that generalizes visual prompting with test-time adaptation. VPA introduces a small number of learnable tokens, enabling fully test-time and storage-efficient adaptation without necessitating source-domain information. We examine our VPA design under diverse adaptation settings, encompassing single-image, batched-image, and pseudo-label adaptation. We evaluate VPA on multiple tasks, including out-of-distribution (OOD) generalization, corruption robustness, and domain adaptation. Experimental results reveal that VPA effectively enhances OOD generalization by 3.3% across various models, surpassing previous test-time approaches. Furthermore, we show that VPA improves corruption robustness by 6.5% compared to strong baselines. Finally, we demonstrate that VPA also boosts domain adaptation performance by relatively 5.2%. Our VPA also exhibits marked effectiveness in improving the robustness of zero-shot recognition for vision-language models.

Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2309.15251 [cs.CV]
	(or arXiv:2309.15251v1 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2309.15251

Submission history

From: Jiachen Sun [view email]
[v1] Tue, 26 Sep 2023 20:25:51 UTC (2,354 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:VPA: Fully Test-Time Visual Prompt Adaptation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:VPA: Fully Test-Time Visual Prompt Adaptation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators