June 29, 2022, 1:12 a.m. | Marion Bartl, Susan Leavy

cs.CL updates on arXiv.org arxiv.org

This paper presents a new method for automatically detecting words with
lexical gender in large-scale language datasets. Currently, the evaluation of
gender bias in natural language processing relies on manually compiled lexicons
of gendered expressions, such as pronouns ('he', 'she', etc.) and nouns with
lexical gender ('mother', 'boyfriend', 'policewoman', etc.). However, manual
compilation of such lists can lead to static information if they are not
periodically updated and often involve value judgments by individual annotators
and researchers. Moreover, terms not …

arxiv databases gender inference methodology scalable

Data Engineer

@ Bosch Group | San Luis Potosí, Mexico

DATA Engineer (H/F)

@ Renault Group | FR REN RSAS - Le Plessis-Robinson (Siège)

Advisor, Data engineering

@ Desjardins | 1, Complexe Desjardins, Montréal

Data Engineer Intern

@ Getinge | Wayne, NJ, US

Software Engineer III- Java / Python / Pyspark / ETL

@ JPMorgan Chase & Co. | Jersey City, NJ, United States

Lead Data Engineer (Azure/AWS)

@ Telstra | Telstra ICC Bengaluru