all AI news
CMNER: A Chinese Multimodal NER Dataset based on Social Media
Feb. 22, 2024, 5:48 a.m. | Yuanze Ji, Bobo Li, Jun Zhou, Fei Li, Chong Teng, Donghong Ji
cs.CL updates on arXiv.org arxiv.org
Abstract: Multimodal Named Entity Recognition (MNER) is a pivotal task designed to extract named entities from text with the support of pertinent images. Nonetheless, a notable paucity of data for Chinese MNER has considerably impeded the progress of this natural language processing task within the Chinese domain. Consequently, in this study, we compile a Chinese Multimodal NER dataset (CMNER) utilizing data sourced from Weibo, China's largest social media platform. Our dataset encompasses 5,000 Weibo posts paired …
abstract arxiv chinese cs.cl data dataset extract images language language processing media multimodal natural natural language natural language processing ner pivotal processing progress recognition social social media support text type
More from arxiv.org / cs.CL updates on arXiv.org
Jobs in AI, ML, Big Data
Founding AI Engineer, Agents
@ Occam AI | New York
AI Engineer Intern, Agents
@ Occam AI | US
AI Research Scientist
@ Vara | Berlin, Germany and Remote
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Consultant - Artificial Intelligence & Data (Google Cloud Data Engineer) - MY / TH
@ Deloitte | Kuala Lumpur, MY