March 5, 2024, 2:42 p.m. | Nicklas J\"averg{\aa}rd, Rainey Lyons, Adrian Muntean, Jonas Forsman

cs.LG updates on arXiv.org arxiv.org

arXiv:2403.01471v1 Announce Type: new
Abstract: We propose a method to generate statistically representative synthetic data. The main goal is to be able to maintain in the synthetic dataset the correlations of the features present in the original one, while offering a comfortable privacy level that can be eventually tailored on specific customer demands.
We describe in detail our algorithm used both for the analysis of the original dataset and for the generation of the synthetic data points. The approach is …

abstract arxiv correlations cs.lg data dataset eventually features generate math.pr physics.data-an privacy statistical statistical method synthetic synthetic data type

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote