June 6, 2024, 4:42 a.m. | Chengyi Cai, Zesheng Ye, Lei Feng, Jianzhong Qi, Feng Liu

arXiv:2406.03150v1 Announce Type: new
Abstract: Visual reprogramming (VR) is a prompting technique that aims to re-purpose a pre-trained model (e.g., a classifier on ImageNet) to target tasks (e.g., medical data prediction) by learning a small-scale pattern added into input images instead of tuning considerable parameters within the model. The location of the pattern within input samples is usually determined by a pre-defined mask shared across all samples. In this paper, we show that the shared mask potentially limits VR's generalization …

arxiv cs.cv cs.lg masks prompting sample type visual

