all AI news
Preserving correlations: A statistical method for generating synthetic data
March 5, 2024, 2:42 p.m. | Nicklas J\"averg{\aa}rd, Rainey Lyons, Adrian Muntean, Jonas Forsman
cs.LG updates on arXiv.org arxiv.org
Abstract: We propose a method to generate statistically representative synthetic data. The main goal is to be able to maintain in the synthetic dataset the correlations of the features present in the original one, while offering a comfortable privacy level that can be eventually tailored on specific customer demands.
We describe in detail our algorithm used both for the analysis of the original dataset and for the generation of the synthetic data points. The approach is …
abstract arxiv correlations cs.lg data dataset eventually features generate math.pr physics.data-an privacy statistical statistical method synthetic synthetic data type
More from arxiv.org / cs.LG updates on arXiv.org
Jobs in AI, ML, Big Data
Lead Developer (AI)
@ Cere Network | San Francisco, US
Research Engineer
@ Allora Labs | Remote
Ecosystem Manager
@ Allora Labs | Remote
Founding AI Engineer, Agents
@ Occam AI | New York
AI Engineer Intern, Agents
@ Occam AI | US
AI Research Scientist
@ Vara | Berlin, Germany and Remote