all AI news
MetaIE: Distilling a Meta Model from LLM for All Kinds of Information Extraction Tasks
April 2, 2024, 7:51 p.m. | Letian Peng, Zilong Wang, Feng Yao, Zihan Wang, Jingbo Shang
cs.CL updates on arXiv.org arxiv.org
Abstract: Information extraction (IE) is a fundamental area in natural language processing where prompting large language models (LLMs), even with in-context examples, cannot defeat small LMs tuned on very small IE datasets. We observe that IE tasks, such as named entity recognition and relation extraction, all focus on extracting important information, which can be formalized as a label-to-span matching. In this paper, we propose a novel framework MetaIE to build a small LM as meta-model by …
abstract arxiv context cs.cl datasets examples extraction information information extraction language language models language processing large language large language models llm llms lms meta natural natural language natural language processing observe processing prompting recognition small tasks type
More from arxiv.org / cs.CL updates on arXiv.org
Jobs in AI, ML, Big Data
Founding AI Engineer, Agents
@ Occam AI | New York
AI Engineer Intern, Agents
@ Occam AI | US
AI Research Scientist
@ Vara | Berlin, Germany and Remote
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne