all AI news
CPT: Colorful Prompt Tuning for Pre-trained Vision-Language Models. (arXiv:2109.11797v3 [cs.CV] UPDATED)
May 23, 2022, 1:12 a.m. | Yuan Yao, Ao Zhang, Zhengyan Zhang, Zhiyuan Liu, Tat-Seng Chua, Maosong Sun
cs.CV updates on arXiv.org arxiv.org
Pre-Trained Vision-Language Models (VL-PTMs) have shown promising
capabilities in grounding natural language in image data, facilitating a broad
variety of cross-modal tasks. However, we note that there exists a significant
gap between the objective forms of model pre-training and fine-tuning,
resulting in a need for large amounts of labeled data to stimulate the visual
grounding capability of VL-PTMs for downstream tasks. To address the challenge,
we present Cross-modal Prompt Tuning (CPT, alternatively, Colorful Prompt
Tuning), a novel paradigm for tuning …
More from arxiv.org / cs.CV updates on arXiv.org
Jobs in AI, ML, Big Data
Senior ML Researcher - 3D Geometry Processing | 3D Shape Generation | 3D Mesh Data
@ Promaton | Europe
Research Assistant/Associate, Health Data Science [LKCMedicine]
@ Nanyang Technological University | NTU Novena Campus, Singapore
Senior Machine Learning Engineer, Portfolio ML
@ Affirm | Remote Canada
[Sessional Lecturer] Foundations of Data Analytics and Machine Learning - APS1070
@ University of Toronto | Toronto, ON, CA
Senior Data Scientist
@ Prosper | United States
Data Analyst
@ ZF Friedrichshafen AG | Coimbatore, TN, IN, 641659