CMMMU: A Chinese Massive Multi-discipline Multimodal Understanding Benchmark | allainews.com

March 19, 2024, 4:51 a.m. | Ge Zhang, Xinrun Du, Bei Chen, Yiming Liang, Tongxu Luo, Tianyu Zheng, Kang Zhu, Yuyang Cheng, Chunpu Xu, Shuyue Guo, Haoran Zhang, Xingwei Qu, Junjie

cs.CV updates on arXiv.org arxiv.org

arXiv:2401.11944v2 Announce Type: replace-cross
Abstract: As the capabilities of large multimodal models (LMMs) continue to advance, evaluating the performance of LMMs emerges as an increasing need. Additionally, there is an even larger gap in evaluating the advanced knowledge and reasoning abilities of LMMs in non-English contexts such as Chinese. We introduce CMMMU, a new Chinese Massive Multi-discipline Multimodal Understanding benchmark designed to evaluate LMMs on tasks demanding college-level subject knowledge and deliberate reasoning in a Chinese context. CMMMU is inspired …

abstract advance advanced arxiv benchmark capabilities chinese cs.ai cs.cl cs.cv english gap knowledge large multimodal models lmms massive multimodal multimodal models performance reasoning type understanding

More from arxiv.org / cs.CV updates on arXiv.org

GPT-4V(ision) for Robotics: Multimodal Task Planning from Human Demonstration 4 hours ago | arxiv.org

abstract arxiv cs.cl cs.cv +25

Dynamic Open Vocabulary Enhanced Safe-landing with Intelligence (DOVESEI) 4 hours ago | arxiv.org

abstract arxiv attention cs.ai +16

CoVid-19 Detection leveraging Vision Transformers and Explainable AI 4 hours ago | arxiv.org

abstract arxiv covid covid-19 +19

SAR image matching algorithm based on multi-class features 4 hours ago | arxiv.org

abstract algorithm application arxiv +13

Enhancing Sign Language Teaching: A Mixed Reality Approach for Immersive Learning and Multi-Dimensional Feedback 4 hours ago | arxiv.org

abstract arxiv challenges classroom +13

A Linear Time and Space Local Point Cloud Geometry Encoder via Vectorized Kernel Mixture (VecKM) 4 hours ago | arxiv.org

abstract arxiv cloud compute +11

UP-CrackNet: Unsupervised Pixel-Wise Road Crack Detection via Adversarial Image Restoration 4 hours ago | arxiv.org

abstract adversarial algorithms arxiv +21

AttributionScanner: A Visual Analytics System for Model Validation with Metadata-Free Slice Finding 4 hours ago | arxiv.org

abstract analytics arxiv context +19

FurniScene: A Large-scale 3D Room Dataset with Intricate Furnishing Scenes 4 hours ago | arxiv.org

abstract applications arxiv attention +15

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Consultant - Artificial Intelligence & Data (Google Cloud Data Engineer) - MY / TH

@ Deloitte | Kuala Lumpur, MY

View on ai-jobs.net