An Inverse Scaling Law for CLIP Training. (arXiv:2305.07017v1 [cs.CV]) | allainews.com

May 12, 2023, 12:45 a.m. | Xianhang Li, Zeyu Wang, Cihang Xie

cs.CV updates on arXiv.org arxiv.org

CLIP, the first foundation model that connects images and text, has enabled
many recent breakthroughs in computer vision. However, its associated training
cost is prohibitively high, imposing a significant barrier to its widespread
exploration. In this paper, we present a surprising finding that there exists
an inverse scaling law for CLIP training, whereby the larger the image/text
encoders used, the shorter the sequence length of image/text tokens that can be
applied in training. Moreover, we showcase that the strategy for …

arxiv clip computer computer vision cost exploration foundation foundation model images law paper scaling scaling law text training vision

More from arxiv.org / cs.CV updates on arXiv.org

DisBeaNet: A Deep Neural Network to augment Unmanned Surface Vessels for maritime situational awareness 21 hours ago | arxiv.org

abstract arxiv augment automated +18

Contextual Embedding Learning to Enhance 2D Networks for Volumetric Image Segmentation 21 hours ago | arxiv.org

arxiv cs.cv eess.iv embedding +5

KI-PMF: Knowledge Integrated Plausible Motion Forecasting 21 hours ago | arxiv.org

abstract actors arxiv autonomous +18

Adaptive Landmark Color for AUV Docking in Visually Dynamic Environments 21 hours ago | arxiv.org

abstract arxiv autonomous batteries +15

OccupancyDETR: Using DETR for Mixed Dense-sparse 3D Occupancy Prediction 21 hours ago | arxiv.org

abstract arxiv autonomous autonomous vehicles +21

Multimodal Chain-of-Thought Reasoning in Language Models 21 hours ago | arxiv.org

arxiv cs.ai cs.cl cs.cv +7

Robust Self-Tuning Data Association for Geo-Referencing Using Lane Markings 21 hours ago | arxiv.org

abstract advantages aerial arxiv +16

Surrogate-based cross-correlation for particle image velocimetry 21 hours ago | arxiv.org

arxiv correlation cs.cv eess.iv +5

Advancing Human Action Recognition with Foundation Models trained on Unlabeled Public Videos 21 hours ago | arxiv.org

abstract action action recognition advance +20

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

.NET Software Engineer (AI Focus)

@ Boskalis | Papendrecht, Netherlands

View on ai-jobs.net