How do languages influence each other? Studying cross-lingual data sharing during LM fine-tuning | allainews.com

May 22, 2024, 4:47 a.m. | Rochelle Choenni, Dan Garrette, Ekaterina Shutova

cs.CL updates on arXiv.org arxiv.org

arXiv:2305.13286v2 Announce Type: replace
Abstract: Multilingual large language models (MLLMs) are jointly trained on data from many different languages such that representation of individual languages can benefit from other languages' data. Impressive performance on zero-shot cross-lingual transfer shows that these models are capable of exploiting data from other languages. Yet, it remains unclear to what extent, and under which conditions, languages rely on each other's data. In this study, we use TracIn (Pruthi et al., 2020), a training data attribution …

abstract arxiv benefit cross-lingual cs.cl data data sharing fine-tuning influence language language models languages large language large language models mllms multilingual performance replace representation shows studying transfer type zero-shot

More from arxiv.org / cs.CL updates on arXiv.org

Multimodal Sentiment Analysis with Missing Modality: A Knowledge-Transfer Approach 15 hours ago | arxiv.org

abstract algorithms analysis arxiv +22

Advancing Abductive Reasoning in Knowledge Graphs through Complex Logical Hypothesis Generation 15 hours ago | arxiv.org

abstract applications arxiv cs.ai +13

LLM-SQL-Solver: Can LLMs Determine SQL Equivalence? 15 hours ago | arxiv.org

abstract applications arxiv community +24

RLHFPoison: Reward Poisoning Attack for Reinforcement Learning with Human Feedback in Large Language Models 15 hours ago | arxiv.org

abstract advantages alignment arxiv +22

Exploring ChatGPT's Capabilities on Vulnerability Management 15 hours ago | arxiv.org

abstract analysis arxiv attention +22

Human Action Co-occurrence in Lifestyle Vlogs using Graph Link Prediction 15 hours ago | arxiv.org

action arxiv cs.cl cs.cv +9

Advancing continual lifelong learning in neural information retrieval: definition, dataset, framework, and empirical evaluation 15 hours ago | arxiv.org

abstract adapt arxiv capability +19

Exploring Transfer Learning in Medical Image Segmentation using Vision-Language Models 15 hours ago | arxiv.org

arxiv cs.ai cs.cl cs.cv +13

mBLIP: Efficient Bootstrapping of Multilingual Vision-LLMs 15 hours ago | arxiv.org

arxiv bootstrapping cs.cl cs.cv +5

Senior Data Engineer

@ Displate | Warsaw

View on ai-jobs.net

Senior Principal Software Engineer

@ Oracle | Columbia, MD, United States

View on ai-jobs.net

Software Engineer for Manta Systems

@ PXGEO | Linköping, Östergötland County, Sweden

View on ai-jobs.net

DevOps Engineer

@ Teradyne | Odense, DK

View on ai-jobs.net

LIDAR System Engineer Trainee

@ Valeo | PRAGUE - PRA2

View on ai-jobs.net

Business Applications Administrator

@ Allegro | Poznań, Poland

View on ai-jobs.net