Jan. 7, 2024, 12:37 p.m. | Shivamshinde

Towards AI - Medium pub.towardsai.net

From Raw to Refined: A Journey Through Data Preprocessing — Part 6: Imbalanced Datasets

This article will explain the concept of imbalanced datasets and the methods used to handle them.

Photo by Colton Sturgeon on Unsplash

Table of Content

  1. What is imbalanced data?
  2. Degree of imbalance
  3. Why having an imbalanced dataset is a problem?
  4. Methods to deal with imbalanced data
    -
    Try getting more data
    -
    Try changing the performance metric
    -
    Try sampling of the data
    -
    Try different …

imbalanced-data oversampling smote undersampling

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Intern Large Language Models Planning (f/m/x)

@ BMW Group | Munich, DE

Data Engineer Analytics

@ Meta | Menlo Park, CA | Remote, US