Nov. 20, 2023, 4:32 p.m. | /u/Careful_Engineer_700

Data Science www.reddit.com

I am preparing a dataset for a classification task at work, as you can see, I have 13 features with multicollinearity, also, I could not infer any good decisions about what to do given the correlation matrix.

What do you think I should do here? I have a total of 60 features, I cleaned the data and checked for duplicates and outliers, standardized the data and everything, now it’s a matter of feature selection I think?

Could really use some …

classification correlation datascience dataset decisions features good matrix multicollinearity think vif work

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne