Web: http://arxiv.org/abs/2205.03302

May 9, 2022, 1:11 a.m. | Esma Balkir, Isar Nejadgholi, Kathleen C. Fraser, Svetlana Kiritchenko

cs.CL updates on arXiv.org arxiv.org

We present a novel feature attribution method for explaining text
classifiers, and analyze it in the context of hate speech detection. Although
feature attribution models usually provide a single importance score for each
token, we instead provide two complementary and theoretically-grounded scores
-- necessity and sufficiency -- resulting in more informative explanations. We
propose a transparent method that calculates these values by generating
explicit perturbations of the input text, allowing the importance scores
themselves to be explainable. We employ our …

arxiv case study detection hate speech speech study text

Data Analyst, Patagonia Action Works

@ Patagonia | Remote

Data & Insights Strategy & Innovation General Manager

@ Chevron Services Company, a division of Chevron U.S.A Inc. | Houston, TX

Faculty members in Research areas such as Bayesian and Spatial Statistics; Data Privacy and Security; AI/ML; NLP; Image and Video Data Analysis

@ Ahmedabad University | Ahmedabad, India

Director, Applied Mathematics & Computational Research Division

@ Lawrence Berkeley National Lab | Berkeley, Ca

Business Data Analyst

@ MainStreet Family Care | Birmingham, AL

Assistant/Associate Professor of the Practice in Business Analytics

@ Georgetown University McDonough School of Business | Washington DC