Web: http://arxiv.org/abs/2208.13080

Sept. 22, 2022, 1:13 a.m. | Ethan Pickering, Themistoklis P. Sapsis

stat.ML updates on arXiv.org arxiv.org

Not all data are equal. Misleading or unnecessary data can critically hinder
the accuracy of Machine Learning (ML) models. When data is plentiful,
misleading effects can be overcome, but in many real-world applications data is
sparse and expensive to acquire. We present a method that substantially reduces
the data size necessary to accurately train ML models, potentially opening the
door for many new, limited-data applications in ML. Our method extracts the
most informative data, while ignoring and omitting data that …

arxiv data information

