Curriculum Learning for Data-Efficient Vision-Language Alignment. (arXiv:2207.14525v1 [cs.CV]) | allainews.com

Aug. 1, 2022, 1:11 a.m. | Tejas Srinivasan, Xiang Ren, Jesse Thomason

cs.CL updates on arXiv.org arxiv.org

Aligning image and text encoders from scratch using contrastive learning
requires large amounts of paired image-text data. We alleviate this need by
aligning individually pre-trained language and vision representation models
using a much smaller amount of paired data, augmented with a curriculum
learning algorithm to learn fine-grained vision-language alignments. TOnICS
(Training with Ontology-Informed Contrastive Sampling) initially samples
minibatches whose image-text pairs contain a wide variety of objects to learn
object-level alignment, and progressively samples minibatches where all
image-text pairs contain …

alignment arxiv curriculum curriculum learning cv data language learning vision

More from arxiv.org / cs.CL updates on arXiv.org

ELF: Encoding Speaker-Specific Latent Speech Feature for Speech Synthesis an hour ago | arxiv.org

abstract arxiv cs.cl cs.sd +14

LSTM-based Deep Neural Network With A Focus on Sentence Representation for Sequential Sentence Classification in … an hour ago | arxiv.org

abstract arxiv classification cs.cl +13

Improving Text Embeddings with Large Language Models an hour ago | arxiv.org

abstract arxiv cs.cl cs.ir +22

The Earth is Flat because...: Investigating LLMs' Belief towards Misinformation via Persuasive Conversation an hour ago | arxiv.org

abstract arxiv behavior belief +22

When MOE Meets LLMs: Parameter Efficient Fine-tuning for Multi-task Medical Applications an hour ago | arxiv.org

abstract applications arxiv attention +19

TRAM: Benchmarking Temporal Reasoning for Large Language Models an hour ago | arxiv.org

abstract arxiv benchmarking benchmarks +17

Multi-hop Question Answering an hour ago | arxiv.org

abstract ai systems arxiv cs.ai +18

Towards a Fluid computer an hour ago | arxiv.org

abstract article arxiv computer +13

CWRCzech: 100M Query-Document Czech Click Dataset and Its Application to Web Relevance Ranking an hour ago | arxiv.org

application arxiv click cs.cl +8

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

View on ai-jobs.net

ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)

@ HelloBetter | Remote

View on ai-jobs.net

Doctoral Researcher (m/f/div) in Automated Processing of Bioimages

@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena

View on ai-jobs.net

Seeking Developers and Engineers for AI T-Shirt Generator Project

@ Chevon Hicks | Remote

View on ai-jobs.net

Senior Applied Data Scientist

@ dunnhumby | London

View on ai-jobs.net

Principal Data Architect - Azure & Big Data

@ MGM Resorts International | Home Office - US, NV

View on ai-jobs.net