Computer Science > Computer Vision and Pattern Recognition

arXiv:2403.13282 (cs)

[Submitted on 20 Mar 2024 (v1), last revised 14 Jun 2024 (this version, v2)]

Title:AdaViPro: Region-based Adaptive Visual Prompt for Large-Scale Models Adapting

Authors:Mengyu Yang, Ye Tian, Lanshan Zhang, Xiao Liang, Xuming Ran, Wendong Wang

Abstract:Recently, prompt-based methods have emerged as a new alternative `parameter-efficient fine-tuning' paradigm, which only fine-tunes a small number of additional parameters while keeping the original model frozen. However, despite achieving notable results, existing prompt methods mainly focus on `what to add', while overlooking the equally important aspect of `where to add', typically relying on the manually crafted placement. To this end, we propose a region-based Adaptive Visual Prompt, named AdaViPro, which integrates the `where to add' optimization of the prompt into the learning process. Specifically, we reconceptualize the `where to add' optimization as a problem of regional decision-making. During inference, AdaViPro generates a regionalized mask map for the whole image, which is composed of 0 and 1, to designate whether to apply or discard the prompt in each specific area. Therefore, we employ Gumbel-Softmax sampling to enable AdaViPro's end-to-end learning through standard back-propagation. Extensive experiments demonstrate that our AdaViPro yields new efficiency and accuracy trade-offs for adapting pre-trained models.

Comments:	Accepted by ICIP 2024
Subjects:	Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2403.13282 [cs.CV]
	(or arXiv:2403.13282v2 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2403.13282

Submission history

From: Mengyu Yang [view email]
[v1] Wed, 20 Mar 2024 03:47:53 UTC (3,837 KB)
[v2] Fri, 14 Jun 2024 07:00:30 UTC (3,837 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:AdaViPro: Region-based Adaptive Visual Prompt for Large-Scale Models Adapting

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:AdaViPro: Region-based Adaptive Visual Prompt for Large-Scale Models Adapting

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators