Do Large Vision-language Models Understand Charts? We found that the answer is NO! | allainews.com

Dec. 28, 2023, 4:22 p.m. | /u/steeveHuang

Deep Learning www.reddit.com

We've just wrapped up a collaborative study with Columbia University and the University of Macau that probes into the capabilities of Large Vision-Language Models (LVLMs) when it comes to understanding and describing charts. The findings are quite startling.

Despite advancements in LVLMs, our research reveals that even the most advanced LVLMs like GPT-4V and Bard fall short. A striking 🚨**81.27%** (321/ 395) 🚨 of the captions they generated contained factual errors, misinterpreting data from charts. This suggests a significant gap …

capabilities charts collaborative columbia university deeplearning found language language models research study understanding university vision vision-language models

More from www.reddit.com / Deep Learning

What does Speaker Embeddings consists of? 12 hours ago | www.reddit.com

architecture deeplearning embeddings lstm +2

Physics-Based Deep Learning: Insights into Physics-Informed Neural Networks (PINNs) 1 day, 5 hours ago | www.reddit.com

deep learning deeplearning insights networks +3

How would one write the following loss function in python? I am currently stuck on … 1 day, 18 hours ago | www.reddit.com

deeplearning function loss python

Tensorflow vs pytorch 1 day, 22 hours ago | www.reddit.com

deep learning deeplearning hey library +5

What is best practice of augmentation on Imbalance dataset? 2 days, 16 hours ago | www.reddit.com

apply articles augmentation case +12

Serving fastchat on single GPU and 5 models! 2 days, 18 hours ago | www.reddit.com

a100 deeplearning gpu instance +9

Cheapest gpu to dip my toes into Ai. training? 2 days, 22 hours ago | www.reddit.com

advice deeplearning gpu investment +3

Can anyone suggest a good Cloud Computing service for me? 3 days, 11 hours ago | www.reddit.com

cloud cloud computing computing deeplearning +11

A visual deep dive into Uber's ML system to solve the billion dollar problem of … 3 days, 15 hours ago | www.reddit.com

algorithms attention billion deep dive +11

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Data Scientist

@ Publicis Groupe | New York City, United States

View on ai-jobs.net

Bigdata Cloud Developer - Spark - Assistant Manager

@ State Street | Hyderabad, India

View on ai-jobs.net