all AI news
D3CODE: Disentangling Disagreements in Data across Cultures on Offensiveness Detection and Evaluation
April 18, 2024, 4:46 a.m. | Aida Mostafazadeh Davani, Mark D\'iaz, Dylan Baker, Vinodkumar Prabhakaran
cs.CL updates on arXiv.org arxiv.org
Abstract: While human annotations play a crucial role in language technologies, annotator subjectivity has long been overlooked in data collection. Recent studies that have critically examined this issue are often situated in the Western context, and solely document differences across age, gender, or racial groups. As a result, NLP research on subjectivity have overlooked the fact that individuals within demographic groups may hold diverse values, which can influence their perceptions beyond their group norms. To effectively …
abstract age annotations arxiv collection context cs.cl data data collection detection differences document evaluation gender human issue language racial role studies technologies type
More from arxiv.org / cs.CL updates on arXiv.org
Jobs in AI, ML, Big Data
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Software Engineer, Data Tools - Full Stack
@ DoorDash | Pune, India
Senior Data Analyst
@ Artsy | New York City