all AI news
[discussion] data preprocessing before or after splitting
May 19, 2022, 11:20 p.m. | /u/YeccAnon4
Machine Learning www.reddit.com
I have a dataset with financial values for companies , some features have missing values and the data is unbalanced.
Here's what i did
Scaling the Data with sklearn standard Scaler
Missing values imputation with knn imputer and iterative imputer trying several strategies and choosing the best one
Oversampling with smote
Testing with random forest classifier.
The next step is to add more features :
1 by calculating some features from my already …
More from www.reddit.com / Machine Learning
Jobs in AI, ML, Big Data
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Data Engineer
@ Chubb | Simsbury, CT, United States
Research Analyst , NA Light Vehicle Powertrain Forecasting
@ S&P Global | US - MI - VIRTUAL
Sr. Data Scientist - ML Ops Job
@ Yash Technologies | Indore, IN
Alternance-Data Management
@ Keolis | Courbevoie, FR, 92400