Scaling PyCaret with Spark (or Dask) through Fugue | allainews.com

Jan. 7, 2022, 5:56 a.m. | Kevin Kho

Towards Data Science - Medium towardsdatascience.com

Run PyCaret functions on each partition of data distributedly

Photo by Hannes Egler on Unsplash

PyCaret is a low code machine learning framework that automates a lot of parts of the machine learning pipeline. With just a few lines of code, several models can be trained on a dataset. In this post, we explore how to scale this capability by running several PyCaret training jobs in a distributed manner on Spark or Dask.

PyCaret Model Score Grid Example

First, we …

dask data science fugue machine learning pycaret scaling spark

More from towardsdatascience.com / Towards Data Science - Medium

Data Science Project Management an hour ago | towardsdatascience.com

ai article data data science +16

To Know Is Also to Remember 2 hours ago | towardsdatascience.com

data data science deep learning long short-term memory +7

How to Keep on Developing as a Data Scientist 3 hours ago | towardsdatascience.com

career advice daily data data science +8

My First Steps into Mastering SAP’s Data Models 15 hours ago | towardsdatascience.com

complexity data data engineering data mining +9

Uncertainty Quantification and Why You Should Care 15 hours ago | towardsdatascience.com

author conformal-prediction data science getting-started +7

Experimenting with MLFlow and Microsoft Fabric 15 hours ago | towardsdatascience.com

machine learning microsoft fabric mlflow mlops +1

physipy: make python unit-aware 15 hours ago | towardsdatascience.com

data data science joule numpy +6

Why Deep Learning Models Run Faster on GPUs: A Brief Introduction to CUDA Programming 16 hours ago | towardsdatascience.com

ai cuda deep learning gpu +1

Python Meets Pawn 2: Clustering Chess Grandmasters based on their Openings 17 hours ago | towardsdatascience.com

blog chess chess-openings clustering +10

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Alternant Data Engineering

@ Aspire Software | Angers, FR

View on ai-jobs.net

Senior Software Engineer, Generative AI

@ Google | Dublin, Ireland

View on ai-jobs.net