Web: http://arxiv.org/abs/2204.04487

May 5, 2022, 1:11 a.m. | Jacob Eisenstein

cs.CL updates on arXiv.org arxiv.org

Spurious correlations are a threat to the trustworthiness of natural language
processing systems, motivating research into methods for identifying and
eliminating them. However, addressing the problem of spurious correlations
requires more clarity on what they are and how they arise in language data.
Gardner et al (2021) argue that the compositional nature of language implies
that \emph{all} correlations between labels and individual "input features" are
spurious. This paper analyzes this proposal in the context of a toy example,
demonstrating three …

arxiv language natural natural language on perspectives

Data Analyst, Patagonia Action Works

@ Patagonia | Remote

Data & Insights Strategy & Innovation General Manager

@ Chevron Services Company, a division of Chevron U.S.A Inc. | Houston, TX

Faculty members in Research areas such as Bayesian and Spatial Statistics; Data Privacy and Security; AI/ML; NLP; Image and Video Data Analysis

@ Ahmedabad University | Ahmedabad, India

Director, Applied Mathematics & Computational Research Division

@ Lawrence Berkeley National Lab | Berkeley, Ca

Business Data Analyst

@ MainStreet Family Care | Birmingham, AL

Assistant/Associate Professor of the Practice in Business Analytics

@ Georgetown University McDonough School of Business | Washington DC