See It from My Perspective: Diagnosing the Western Cultural Bias of Large Vision-Language Models in Image Understanding | allainews.com

June 18, 2024, 4:43 a.m. | Amith Ananthram, Elias Stengel-Eskin, Carl Vondrick, Mohit Bansal, Kathleen McKeown

cs.CL updates on arXiv.org arxiv.org

arXiv:2406.11665v1 Announce Type: new
Abstract: Vision-language models (VLMs) can respond to queries about images in many languages. However, beyond language, culture affects how we see things. For example, individuals from Western cultures focus more on the central figure in an image while individuals from Eastern cultures attend more to scene context. In this work, we present a novel investigation that demonstrates and localizes VLMs' Western bias in image understanding. We evaluate large VLMs across subjective and objective visual tasks with …

abstract arxiv beyond bias cs.ai cs.cl cs.cv culture example figure focus however image images language language models languages perspective queries things type understanding vision vision-language vision-language models vlms while

More from arxiv.org / cs.CL updates on arXiv.org

MuTox: Universal MUltilingual Audio-based TOXicity Dataset and Zero-shot Detector 1 day, 7 hours ago | arxiv.org

abstract arxiv audio cs.cl +22

Can Large Language Model Summarizers Adapt to Diverse Scientific Communication Goals? 1 day, 7 hours ago | arxiv.org

abstract adapt arxiv communication +23

ReFT: Reasoning with Reinforced Fine-Tuning 1 day, 7 hours ago | arxiv.org

abstract annotations arxiv capability +22

Deductive Closure Training of Language Models for Coherence, Accuracy, and Updatability 1 day, 7 hours ago | arxiv.org

abstract accuracy arxiv cs.cl +13

Exploring Defeasibility in Causal Reasoning 1 day, 7 hours ago | arxiv.org

abstract arxiv causal causal reasoning +7

Can Large Language Models Follow Concept Annotation Guidelines? A Case Study on Scientific and Financial … 1 day, 7 hours ago | arxiv.org

abstract annotation arxiv capacity +26

Theory of Mind for Multi-Agent Collaboration via Large Language Models 1 day, 7 hours ago | arxiv.org

abstract agent agents arxiv +28

Enhancing Text-based Knowledge Graph Completion with Zero-Shot Large Language Models: A Focus on Semantic Enhancement 1 day, 7 hours ago | arxiv.org

arxiv cs.ai cs.cl focus +12

A Large Language Model Approach to Educational Survey Feedback Analysis 1 day, 7 hours ago | arxiv.org

abstract analysis arxiv capabilities +27

Senior Clinical Data Scientist

@ Novartis | Home Worker

View on ai-jobs.net

R&D Senior Data Scientist 1

@ Jotun | Sandefjord

View on ai-jobs.net

Data Scientist - Corporate Audit, Officer

@ State Street | Toronto, Ontario

View on ai-jobs.net

Senior Manager, Data Science & Analytics Solutions - Safety

@ Hyundai Motor America | Fountain Valley, CA, US, 92708

View on ai-jobs.net

Data Science Working Student (all genders)

@ Merck Group | Darmstadt, Hessen, DE, 64293

View on ai-jobs.net

Senior Data Scientist (m/f/d)

@ BASF | Limburgerhof, DE

View on ai-jobs.net