March 13, 2024, 4:47 a.m. | Oana Ignat, Longju Bai, Joan Nwatu, Rada Mihalcea

cs.CL updates on arXiv.org arxiv.org

arXiv:2403.07687v1 Announce Type: cross
Abstract: Current foundation models have shown impressive performance across various tasks. However, several studies have revealed that these models are not effective for everyone due to the imbalanced geographical and economic representation of the data used in the training process. Most of this data comes from Western countries, leading to poor results for underrepresented countries. To address this issue, more data needs to be collected from these countries, but the cost of annotation can be a …

annotation annotations arxiv balance budget cost cs.ai cs.cl cs.cv data geo performance type

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Sr. Software Development Manager, AWS Neuron Machine Learning Distributed Training

@ Amazon.com | Cupertino, California, USA