April 25, 2024, 5:44 p.m. | Joan Giner-Miguelez, Abel G\'omez, Jordi Cabot

cs.CL updates on arXiv.org arxiv.org

arXiv:2404.15320v1 Announce Type: cross
Abstract: Recent regulatory initiatives like the European AI Act and relevant voices in the Machine Learning (ML) community stress the need to describe datasets along several key dimensions for trustworthy AI, such as the provenance processes and social concerns. However, this information is typically presented as unstructured text in accompanying documentation, hampering their automated analysis and processing. In this work, we explore using large language models (LLM) and a set of prompting strategies to automatically extract …

abstract act ai act arxiv community concerns cs.ai cs.cl cs.dl datasets dimensions documentation however information key language language models large language large language models machine machine learning processes provenance regulatory social stress trustworthy trustworthy ai type voices

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne