Data Banzhaf: A Data Valuation Framework with Maximal Robustness to Learning Stochasticity. (arXiv:2205.15466v2 [cs.LG] UPDATED) | allainews.com

June 30, 2022, 1:11 a.m. | Tianhao Wang, Ruoxi Jia

stat.ML updates on arXiv.org arxiv.org

This paper studies the robustness of data valuation to noisy model
performance scores. Particularly, we find that the inherent randomness of the
widely used stochastic gradient descent can cause existing data value notions
(e.g., the Shapley value and the Leave-one-out error) to produce inconsistent
data value rankings across different runs. To address this challenge, we first
pose a formal framework within which one can measure the robustness of a data
value notion. We show that the Banzhaf value, a value …

arxiv data framework learning lg robustness valuation

More from arxiv.org / stat.ML updates on arXiv.org

Fused Extended Two-Way Fixed Effects for Difference-in-Differences with Staggered Adoptions 10 hours ago | arxiv.org

abstract arxiv bias canonical +16

Dropout Regularization Versus $\ell_2$-Penalization in the Linear Model 10 hours ago | arxiv.org

abstract arxiv behavior convergence +15

Partial recovery and weak consistency in the non-uniform hypergraph Stochastic Block Model 10 hours ago | arxiv.org

abstract algorithm arxiv block +15

Estimating the Number of Components in Finite Mixture Models via Variational Approximation 10 hours ago | arxiv.org

abstract approximation arxiv bayes +11

Conformalized Ordinal Classification with Marginal and Conditional Coverage 10 hours ago | arxiv.org

abstract algorithm applications arxiv +16

Multi-Study R-Learner for Estimating Heterogeneous Treatment Effects Across Studies Using Statistical Machine Learning 19 hours ago | arxiv.org

abstract arxiv effects machine +15

Spatial best linear unbiased prediction: A computational mathematics approach for high dimensional massive datasets 19 hours ago | arxiv.org

abstract arxiv challenges classification +20

Estimation Sample Complexity of a Class of Nonlinear Continuous-time Systems 2 days, 10 hours ago | arxiv.org

abstract arxiv class complexity +14

Estimation and Uniform Inference in Sparse High-Dimensional Additive Models 2 days, 10 hours ago | arxiv.org

abstract arxiv confidence construct +9

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Senior AI & Data Engineer

@ Bertelsmann | Kuala Lumpur, 14, MY, 50400

View on ai-jobs.net

Analytics Engineer

@ Reverse Tech | Philippines - Remote

View on ai-jobs.net