March 18, 2024, 6:27 p.m. | /u/LebrawnJames416

Data Science www.reddit.com

I am currently working on a model that will predict if someone will claim in the next year, there is a class imbalance 80:20 and some casses 98:2. I can get a relatively high roc-auc(0.8 - 0.85) but that is not really appropriate as the confusion matrix shows a large number of false positives. I am now using auc-pr, and getting very low results 0.4 and below.

My question arises from seeing imbalanced classification tasks - from kaggle and research …

auc claim class classifier datascience false false positives matrix next roc shows will

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne