all AI news
Preserving correlations: A statistical method for generating synthetic data
March 5, 2024, 2:42 p.m. | Nicklas J\"averg{\aa}rd, Rainey Lyons, Adrian Muntean, Jonas Forsman
cs.LG updates on arXiv.org arxiv.org
Abstract: We propose a method to generate statistically representative synthetic data. The main goal is to be able to maintain in the synthetic dataset the correlations of the features present in the original one, while offering a comfortable privacy level that can be eventually tailored on specific customer demands.
We describe in detail our algorithm used both for the analysis of the original dataset and for the generation of the synthetic data points. The approach is …
abstract arxiv correlations cs.lg data dataset eventually features generate math.pr physics.data-an privacy statistical statistical method synthetic synthetic data type
More from arxiv.org / cs.LG updates on arXiv.org
Jobs in AI, ML, Big Data
Software Engineer for AI Training Data (School Specific)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Python)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Tier 2)
@ G2i Inc | Remote
Data Engineer
@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US