April 14, 2022, 7:14 a.m. | John Willcox

Towards Data Science - Medium towardsdatascience.com

Investigating whether using categorical embeddings from a fast.ai neural network can improve the performance of CatBoost in a classification task

Photo by Tyler Easton on Unsplash

CatBoost is an open-sourced machine learning algorithm from Yandex. Its name is derived from the words Category Boosting. As you might expect, one of the library’s main advantages is handling categorical data in a more intuitive way than other decision tree algorithms. It also performs favourably compared to other decision tree algorithms. …

catboost embedding fastai machine learning regression

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Research Scientist

@ Meta | Menlo Park, CA

Principal Data Scientist

@ Mastercard | O'Fallon, Missouri (Main Campus)