April 15, 2022, 1:11 a.m. | Anastasiia Sedova, Benjamin Roth

cs.LG updates on arXiv.org arxiv.org

A way to overcome expensive and time-consuming manual data labeling is weak
supervision - automatic annotation of data samples via a predefined set of
labeling functions (LFs), rule-based mechanisms that generate potentially
erroneous labels. In this work, we investigate noise reduction techniques for
weak supervision based on the principle of k-fold cross-validation. In
particular, we extend two frameworks for detecting the erroneous samples in
manually annotated data to the weakly supervised setting. Our methods profit
from leveraging the information about …

arxiv function labeling unsupervised validation

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Data Management Associate

@ EcoVadis | Ebène, Mauritius

Senior Data Engineer

@ Telstra | Telstra ICC Bengaluru