Feb. 9, 2022, 11:30 a.m. | C. Golo Naito

Towards Data Science - Medium towardsdatascience.com

How to deal with imbalanced data

Image by ogamiichiro3 from Pixabay

Introduction

This article describes how to use fastai for multiclass classification, specifically 3,017 classes of Japanese kanji characters. Kanji characters are a major part of the Japanese writing system alongside hiragana and katakana. There are many thousands of kanji characters, which makes it a challenging way to explore multiclass classification models. All the experiments have been executed in Google Colab. I included various code snippets in this article, the …

fastai imbalanced-data japanese transfer learning

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Intern Large Language Models Planning (f/m/x)

@ BMW Group | Munich, DE

Data Engineer Analytics

@ Meta | Menlo Park, CA | Remote, US