Feb. 22, 2024, 5:42 a.m. | Robin Staab, Mark Vero, Mislav Balunovi\'c, Martin Vechev

cs.LG updates on arXiv.org arxiv.org

arXiv:2402.13846v1 Announce Type: cross
Abstract: Recent work in privacy research on large language models has shown that they achieve near human-level performance at inferring personal data from real-world online texts. With consistently increasing model capabilities, existing text anonymization methods are currently lacking behind regulatory requirements and adversarial threats. This raises the question of how individuals can effectively protect their personal data in sharing online texts. In this work, we take two steps to answer this question: We first present a …

abstract advanced adversarial anonymization arxiv capabilities cs.ai cs.cl cs.cr cs.lg data human language language models large language large language models near performance personal data privacy question raises regulatory requirements research text threats type work world

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Machine Learning Research Scientist

@ d-Matrix | San Diego, Ca