all AI news
Using Large Language Models to Enrich the Documentation of Datasets for Machine Learning
April 25, 2024, 5:44 p.m. | Joan Giner-Miguelez, Abel G\'omez, Jordi Cabot
cs.CL updates on arXiv.org arxiv.org
Abstract: Recent regulatory initiatives like the European AI Act and relevant voices in the Machine Learning (ML) community stress the need to describe datasets along several key dimensions for trustworthy AI, such as the provenance processes and social concerns. However, this information is typically presented as unstructured text in accompanying documentation, hampering their automated analysis and processing. In this work, we explore using large language models (LLM) and a set of prompting strategies to automatically extract …
abstract act ai act arxiv community concerns cs.ai cs.cl cs.dl datasets dimensions documentation however information key language language models large language large language models machine machine learning processes provenance regulatory social stress trustworthy trustworthy ai type voices
More from arxiv.org / cs.CL updates on arXiv.org
Jobs in AI, ML, Big Data
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US
Research Engineer
@ Allora Labs | Remote
Ecosystem Manager
@ Allora Labs | Remote
Founding AI Engineer, Agents
@ Occam AI | New York
AI Engineer Intern, Agents
@ Occam AI | US