[R] No “Zero-Shot” Without Exponential Data: Pretraining Concept Frequency Determines Multimodal Model Performance | allainews.com

April 9, 2024, 4:27 a.m. | /u/quequero

Machine Learning www.reddit.com

**Abstract**

>Web-crawled pretraining datasets underlie the impressive "zero-shot" evaluation performance of multimodal models, such as CLIP for classification/retrieval and Stable-Diffusion for image generation. However, it is unclear how meaningful the notion of "zero-shot" generalization is for such multimodal models, as it is not known to what extent their pretraining datasets encompass the downstream concepts targeted for during "zero-shot" evaluation. In this work, we ask: How is the performance of multimodal models on downstream concepts influenced by the frequency of these …

abstract classification clip concept data datasets diffusion evaluation however image image generation machinelearning multimodal multimodal model multimodal models notion performance pretraining retrieval web zero-shot

More from www.reddit.com / Machine Learning

[P] Identify toxic underwater air bubbles lurking in the substrate with aquatic ultrasonic scans via … 2 hours ago | www.reddit.com

arduino classification color identify +11

[P] YARI - Yet Another RAG Implementation. Hybrid context retrieval 3 hours ago | www.reddit.com

api context cosine embedding +14

[D] Is EOS token crucial during pre-training? 6 hours ago | www.reddit.com

documents eos flow information +7

[D] Stack Overflow partnership with OPEN AI 8 hours ago | www.reddit.com

access chart chat chat gpt +16

[D] How does fast inference work with state of the art LLMs? 10 hours ago | www.reddit.com

70b art gpt gpt-4 +11

[D] Llama 3 Monstrosities 1 day, 1 hour ago | www.reddit.com

create easy life llama +4

[D] Get paid for peer reviews on ResearchHub 1 day, 5 hours ago | www.reddit.com

cryptocurrency editor machinelearning mind +6

[D] NER for large text data 1 day, 5 hours ago | www.reddit.com

billion data data scientist hello +8

[P] Table Extraction , Text Extraction 1 day, 6 hours ago | www.reddit.com

block column dataset design +13

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead Data Engineer

@ WorkMoney | New York City, United States - Remote

View on ai-jobs.net