Sept. 16, 2022, 1:12 a.m. | Giuseppe Vietri, Cedric Archambeau, Sergul Aydore, William Brown, Michael Kearns, Aaron Roth, Ankit Siva, Shuai Tang, Zhiwei Steven Wu

cs.LG updates on arXiv.org arxiv.org

We provide a differentially private algorithm for producing synthetic data
simultaneously useful for multiple tasks: marginal queries and multitask
machine learning (ML). A key innovation in our algorithm is the ability to
directly handle numerical features, in contrast to a number of related prior
approaches which require numerical features to be first converted into {high
cardinality} categorical features via {a binning strategy}. Higher binning
granularity is required for better accuracy, but this negatively impacts
scalability. Eliminating the need for binning …

arxiv data multitask learning synthetic data

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Machine Learning Research Scientist

@ d-Matrix | San Diego, Ca