May 19, 2022, 11:20 p.m. | /u/YeccAnon4

Machine Learning www.reddit.com

It could be a very newbie question.

I have a dataset with financial values for companies , some features have missing values and the data is unbalanced.

Here's what i did

Scaling the Data with sklearn standard Scaler
Missing values imputation with knn imputer and iterative imputer trying several strategies and choosing the best one
Oversampling with smote

Testing with random forest classifier.


The next step is to add more features :
1 by calculating some features from my already …

data data preprocessing machinelearning

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Data Engineer

@ Chubb | Simsbury, CT, United States

Research Analyst , NA Light Vehicle Powertrain Forecasting

@ S&P Global | US - MI - VIRTUAL

Sr. Data Scientist - ML Ops Job

@ Yash Technologies | Indore, IN

Alternance-Data Management

@ Keolis | Courbevoie, FR, 92400