Feb. 23, 2024, 5:46 a.m. | Yiqiao Jin, Minje Choi, Gaurav Verma, Jindong Wang, Srijan Kumar

cs.CV updates on arXiv.org arxiv.org

arXiv:2402.14154v1 Announce Type: cross
Abstract: Social media platforms are hubs for multimodal information exchange, encompassing text, images, and videos, making it challenging for machines to comprehend the information or emotions associated with interactions in online spaces. Multimodal Large Language Models (MLLMs) have emerged as a promising solution to address these challenges, yet struggle with accurately interpreting human emotions and complex contents like misinformation. This paper introduces MM-Soc, a comprehensive benchmark designed to evaluate MLLMs' understanding of multimodal social media content. …

abstract arxiv benchmarking cs.cl cs.cv cs.cy emotions images information interactions language language models large language large language models machines making media mllms multimodal platforms soc social social media social media platforms solution spaces text the information type videos

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

Sr. Data Operations

@ Carousell Group | West Jakarta, Indonesia

Senior Analyst, Business Intelligence & Reporting

@ Deutsche Bank | Bucharest

Business Intelligence Subject Matter Expert (SME) - Assistant Vice President

@ Deutsche Bank | Cary, 3000 CentreGreen Way

Enterprise Business Intelligence Specialist

@ NAIC | Kansas City

Senior Business Intelligence (BI) Developer - Associate

@ Deutsche Bank | Cary, 3000 CentreGreen Way