Sept. 2, 2022, 1:15 a.m. | Iyadh Ben Cheikh Larbi, Aljoscha Burchardt, Roland Roller

cs.CL updates on arXiv.org arxiv.org

Clinical text processing has gained more and more attention in recent years.
The access to sensitive patient data, on the other hand, is still a big
challenge, as text cannot be shared without legal hurdles and without removing
personal information. There are many techniques to modify or remove patient
related information, each with different strengths. This paper investigates the
influence of different anonymization techniques on the performance of ML models
using multiple datasets corresponding to five different NLP tasks. Several …

anonymization arxiv nlp processing study text

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Machine Learning Research Scientist

@ d-Matrix | San Diego, Ca