all AI news
Balanced Mixed-Type Tabular Data Synthesis with Diffusion Models
April 15, 2024, 4:42 a.m. | Zeyu Yang, Peikun Guo, Khadija Zanna, Akane Sano
cs.LG updates on arXiv.org arxiv.org
Abstract: Diffusion models have emerged as a robust framework for various generative tasks, such as image and audio synthesis, and have also demonstrated a remarkable ability to generate mixed-type tabular data comprising both continuous and discrete variables. However, current approaches to training diffusion models on mixed-type tabular data tend to inherit the imbalanced distributions of features present in the training dataset, which can result in biased sampling. In this research, we introduce a fair diffusion model …
abstract arxiv audio audio synthesis continuous cs.lg current data diffusion diffusion models framework generate generative however image mixed robust synthesis tabular tabular data tasks training type variables
More from arxiv.org / cs.LG updates on arXiv.org
Jobs in AI, ML, Big Data
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Business Intelligence Manager
@ Sanofi | Budapest
Principal Engineer, Data (Hybrid)
@ Homebase | Toronto, Ontario, Canada