Web: http://arxiv.org/abs/2205.04505

May 11, 2022, 1:10 a.m. | Courtney Mansfield, Amandalynne Paullada, Kristen Howell

cs.CL updates on arXiv.org arxiv.org

Many datasets contain personally identifiable information, or PII, which
poses privacy risks to individuals. PII masking is commonly used to redact
personal information such as names, addresses, and phone numbers from text
data. Most modern PII masking pipelines involve machine learning algorithms.
However, these systems may vary in performance, such that individuals from
particular demographic groups bear a higher risk for having their personal
information exposed. In this paper, we evaluate the performance of three
off-the-shelf PII masking systems on …

arxiv bias detection pii

