May 20, 2022, 1:12 a.m. | David Harbecke, Yuxuan Chen, Leonhard Hennig, Christoph Alt

cs.LG updates on arXiv.org arxiv.org

Relation classification models are conventionally evaluated using only a
single measure, e.g., micro-F1, macro-F1 or AUC. In this work, we analyze
weighting schemes, such as micro and macro, for imbalanced datasets. We
introduce a framework for weighting schemes, where existing schemes are
extremes, and two new intermediate schemes. We show that reporting results of
different weighting schemes better highlights strengths and weaknesses of a
model.

arxiv classification f1

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead Data Engineer

@ WorkMoney | New York City, United States - Remote