all AI news
Statistical Multicriteria Benchmarking via the GSD-Front
June 7, 2024, 4:43 a.m. | Christoph Jansen (Lancaster University Leipzig), Georg Schollmeyer (Ludwig-Maximilians-Universit\"at M\"unchen), Julian Rodemann (Ludwig-Maximilians-U
cs.LG updates on arXiv.org arxiv.org
Abstract: Given the vast number of classifiers that have been (and continue to be) proposed, reliable methods for comparing them are becoming increasingly important. The desire for reliability is broken down into three main aspects: (1) Comparisons should allow for different quality metrics simultaneously. (2) Comparisons should take into account the statistical uncertainty induced by the choice of benchmark suite. (3) The robustness of the comparisons under small deviations in the underlying assumptions should be verifiable. …
abstract arxiv benchmarking classifiers cs.lg front metrics quality reliability statistical stat.me stat.ml them type vast via
More from arxiv.org / cs.LG updates on arXiv.org
Jobs in AI, ML, Big Data
Senior Data Engineer
@ Displate | Warsaw
Senior Robotics Engineer - Applications
@ Vention | Montréal, QC, Canada
Senior Application Security Engineer, SHINE - Security Hub for Innovation and Efficiency
@ Amazon.com | Toronto, Ontario, CAN
Simulation Scientist , WWDE Simulation
@ Amazon.com | Bellevue, Washington, USA
Giáo Viên Steam
@ Việc Làm Giáo Dục | Da Nang, Da Nang, Vietnam
Senior Simulation Developer
@ Vention | Montréal, QC, Canada