Text-centric Alignment for Multi-Modality Learning | allainews.com

Feb. 14, 2024, 5:41 a.m. | Yun-Da Tsai Ting-Yu Yen Pei-Fu Guo Zhe-Yan Li Shou-De Lin

cs.LG updates on arXiv.org arxiv.org

This research paper addresses the challenge of modality mismatch in multimodal learning, where the modalities available during inference differ from those available at training. We propose the Text-centric Alignment for Multi-Modality Learning (TAMML) approach, an innovative method that utilizes Large Language Models (LLMs) with in-context learning and foundation models to enhance the generalizability of multimodal systems under these conditions. By leveraging the unique properties of text as a unified semantic space, TAMML demonstrates significant improvements in handling unseen, diverse, and …

alignment challenge context cs.cl cs.cv cs.lg foundation in-context learning inference language language models large language large language models llms multimodal multimodal learning paper research research paper text training

More from arxiv.org / cs.LG updates on arXiv.org

LangProp: A code optimization framework using Large Language Models applied to driving 15 hours ago | arxiv.org

arxiv code cs.ai cs.lg +10

MRI Scan Synthesis Methods based on Clustering and Pix2Pix 15 hours ago | arxiv.org

abstract arxiv automated brain +16

Continual Diffusion with STAMINA: STack-And-Mask INcremental Adapters 15 hours ago | arxiv.org

abstract arxiv concept concepts +21

Improving Interpretation Faithfulness for Vision Transformers 15 hours ago | arxiv.org

abstract adversarial adversarial attacks architectures +21

Training robust and generalizable quantum models 15 hours ago | arxiv.org

abstract adversarial arxiv context +15

Causal Discovery Under Local Privacy 15 hours ago | arxiv.org

abstract application arxiv causal +19

From Neural Activations to Concepts: A Survey on Explaining Concepts in Neural Networks 15 hours ago | arxiv.org

abstract act arxiv concepts +13

It's About Time: Temporal References in Emergent Communication 15 hours ago | arxiv.org

abstract agents arxiv autonomous +21

Learning Risk-Aware Quadrupedal Locomotion using Distributional Reinforcement Learning 15 hours ago | arxiv.org

arxiv cs.lg cs.ro reinforcement +3

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data Scientist (Database Development)

@ Nasdaq | Bengaluru-Affluence

View on ai-jobs.net