Web: http://arxiv.org/abs/2201.10547

Jan. 26, 2022, 2:11 a.m. | Mariel A. Werner, Anastasios Angelopoulos, Stephen Bates, Michael I. Jordan

cs.LG updates on arXiv.org arxiv.org

The blessing of ubiquitous data also comes with a curse: the communication,
storage, and labeling of massive, mostly redundant datasets. In our work, we
seek to solve the problem at its source, collecting only valuable data and
throwing out the rest, via active learning. We propose an online algorithm
which, given any stream of data, any assessment of its value, and any
formulation of its selection cost, extracts the most valuable subset of the
stream up to a constant factor …

active learning arxiv learning online

More from arxiv.org / cs.LG updates on arXiv.org

Senior Data Engineer

@ DAZN | Hammersmith, London, United Kingdom

Sr. Data Engineer, Growth

@ Netflix | Remote, United States

Data Engineer - Remote

@ Craft | Wrocław, Lower Silesian Voivodeship, Poland

Manager, Operations Data Science

@ Binance.US | Vancouver

Senior Machine Learning Researcher for Copilot

@ GitHub | Remote - Europe

Sr. Marketing Data Analyst

@ HoneyBook | San Francisco, CA