Feb. 20, 2024, 5:42 a.m. | Daniel Kowatsch, Nicolas M. M\"uller, Kilian Tscharke, Philip Sperl, Konstantin B\"otinger

cs.LG updates on arXiv.org arxiv.org

arXiv:2402.11963v1 Announce Type: new
Abstract: For classification, the problem of class imbalance is well known and has been extensively studied. In this paper, we argue that imbalance in regression is an equally important problem which has so far been overlooked: Due to under- and over-representations in a data set's target distribution, regressors are prone to degenerate to naive models, systematically neglecting uncommon training data and over-representing targets seen often during training. We analyse this problem theoretically and use resulting insights …

abstract arxiv class classification cs.ai cs.lg data data set datasets distribution paper regression set type

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US