Provably Confidential Language Modelling. (arXiv:2205.01863v2 [cs.CL] UPDATED) | allainews.com

June 27, 2022, 1:11 a.m. | Xuandong Zhao, Lei Li, Yu-Xiang Wang

cs.CL updates on arXiv.org arxiv.org

Large language models are shown to memorize privacy information such as
social security numbers in training data. Given the sheer scale of the training
corpus, it is challenging to screen and filter these privacy data, either
manually or automatically. In this paper, we propose Confidentially Redacted
Training (CRT), a method to train language generation models while protecting
the confidential segments. We borrow ideas from differential privacy (which
solves a related but distinct problem) and show that our method is able …

arxiv language modelling

More from arxiv.org / cs.CL updates on arXiv.org

A Survey of Graph Meets Large Language Model: Progress and Future Directions 9 hours ago | arxiv.org

arxiv cs.cl cs.lg cs.si +9

Never Train from Scratch: Fair Comparison of Long-Sequence Models Requires Data-Driven Priors 9 hours ago | arxiv.org

abstract architectures arxiv benchmarks +18

LLMCheckup: Conversational Examination of Large Language Models via Interpretability Tools and Self-Explanations 9 hours ago | arxiv.org

abstract arxiv conversational cs.ai +17

DP-NMT: Scalable Differentially-Private Machine Translation 9 hours ago | arxiv.org

abstract arxiv concerns concrete +22

DEFT: Data Efficient Fine-Tuning for Pre-Trained Language Models via Unsupervised Core-Set Selection 9 hours ago | arxiv.org

abstract advances arxiv availability +16

RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models 9 hours ago | arxiv.org

abstract art arxiv benchmarking +21

Emotionally Numb or Empathetic? Evaluating How LLMs Feel Using EmotionBench 9 hours ago | arxiv.org

arxiv cs.cl llms type

Noise-Robust De-Duplication at Scale 9 hours ago | arxiv.org

abstract applications articles arxiv +18

ICDM 2020 Knowledge Graph Contest: Consumer Event-Cause Extraction 9 hours ago | arxiv.org

abstract applications arxiv attention +16

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Data Management Assistant

@ World Vision | Amman Office, Jordan

View on ai-jobs.net

Cloud Data Engineer, Global Services Delivery, Google Cloud

@ Google | Buenos Aires, Argentina

View on ai-jobs.net