Oct. 7, 2022, 1:17 a.m. | Marta R. Costa-jussà, Eric Smith, Christophe Ropers, Daniel Licht, Javier Ferrando, Carlos Escolano

cs.CL updates on arXiv.org arxiv.org

Machine Translation systems can produce different types of errors, some of
which get characterized as critical or catastrophic due to the specific
negative impact they can have on users. Automatic or human evaluation metrics
do not necessarily differentiate between such critical errors and more
innocuous ones. In this paper we focus on one type of critical error: added
toxicity. We evaluate and analyze added toxicity when translating a large
evaluation dataset (HOLISTICBIAS, over 472k sentences, covering 13 demographic
axes) from …

arxiv machine machine translation scale toxicity translation

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Data Scientist

@ Publicis Groupe | New York City, United States

Bigdata Cloud Developer - Spark - Assistant Manager

@ State Street | Hyderabad, India