April 18, 2022, 3:37 p.m. | /u/JillOfNoTrades

Data Science www.reddit.com

I recently ran a model with 25 fold cross validation. I obtained the cross validated predictions, and low and behold, the ROC AUC is only 0.56. Well, shucks. That's not that good. However, when I break these predictions up into 5 quantiles, there is a very blatant trend in the real target, which looks almost too good to be true:

​

​

|Out of Fold Quantile|Average Target|
|:-|:-|
|0|0.096|
|1|0.101|
|2|0.118|
|3|0.133|
|4|0.163|

​

Graphically, this looks like:

​

https://preview.redd.it/mvqy31fz3bu81.png?width=480&format=png&auto=webp&s=25d5a89d03737d629669113028a33a843827bbb5 …

auc datascience predictions

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead Data Engineer

@ WorkMoney | New York City, United States - Remote