ArXiv Pre-Print “Evaluating AI Systems under Uncertain Ground Truth: a Case Study in Dermatology” | allainews.com

d

Nov. 1, 2023, 1:06 p.m. | David Stutz

Blog Archives • David Stutz davidstutz.de

In supervised machine learning, we usually assume access to ground truth label for evaluation. In many applications, however, these ground truth labels are derived from expert opinions. Disagreement among these experts is typically ignored using simple majority voting or averaging. Unfortunately, this can have severe consequences by over-estimating performance or mis-guiding model selection. In our work presented in this article, we tackle this problem by introducing a statistical framework for aggregating expert opinions.

The post ArXiv Pre-Print “Evaluating AI Systems …

ai systems applications arxiv blog case case study computer vision consequences dermatology evaluation expert experts health labels machine machine learning opinions publication simple study supervised machine learning systems uncertain uncertainty-estimation voting

More from davidstutz.de / Blog Archives • David Stutz

Bl

On NeurIPS’ High School Paper Track 2 weeks, 3 days ago | davidstutz.de

academia ai researchers blog career +14

Bl

Thoughts on Academia and Industry in Machine Learning Research 1 month ago | davidstutz.de

academia blog career conversation +11

Bl

On the Utility of Conformal Prediction Intervals 2 months, 1 week ago | davidstutz.de

article blog good machine learning +7

Bl

Vanderbilt Machine Learning Seminar Talk “Conformal Prediction under Ambiguous Ground Truth” 5 months, 2 weeks ago | davidstutz.de

adapt blog computer vision labels +11

Bl

PRECISE Seminar Talk “Evaluating and Calibrating AI Models with Uncertain Ground Truth” 5 months, 3 weeks ago | davidstutz.de

ai models blog center computer vision +10

Bl

TMLR Paper “Conformal Prediction under Ambiguous Ground Truth” 5 months, 4 weeks ago | davidstutz.de

blog classifier computer vision confidence +12

Bl

ArXiv Pre-Print “Evaluating AI Systems under Uncertain Ground Truth: a Case Study in Dermatology” 6 months ago | davidstutz.de

ai systems applications arxiv blog +21

Bl

Interviewed by AI Coffee Break with Letitia 6 months ago | davidstutz.de

academia adversarial adversarial machine learning blog +13

Bl

Benchmarking Bit Errors in Quantized Neural Networks with PyTorch 6 months, 2 weeks ago | davidstutz.de

adversarial adversarial machine learning article articles +19

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Software Engineer, Data Tools - Full Stack

@ DoorDash | Pune, India

View on ai-jobs.net

Senior Data Analyst

@ Artsy | New York City

View on ai-jobs.net