Apache Spark (Pt. 2): MLlib - ML 074

June 2, 2022, 10 a.m. | Top End Devs, Ben Wilson, Michael Berk

Adventures in Machine Learning redcircle.com

MLlib is Apache Spark's scalable machine learning library. Today, Ben and Michael discuss the ease of use, performance, algorithms, and utilities included in this library and how to execute the best ML workflow with MLlib.

In this episode...

Why stick with Spark libraries vs. a single node operation?

What algorithms are not in Spark Lib?

What is the min. package set to use for supervised learning?

Modeling and validation

Down-sampling your data

MLlib vs. scikit-learn

Resources

Sponsors

Top End Devs …

apache apache spark ml mllib spark

Visit resource

More from redcircle.com / Adventures in Machine Learning

Data Platform Innovation: Navigating Challenges and Building a Unified Experience - ML 147 1 week ago | redcircle.com

The Science-Engineering Blend - ML 146 2 weeks ago | redcircle.com

The Impact of Process on Successful Tech Companies - ML 145 3 weeks ago | redcircle.com

Delivering Scoped Solutions: Lessons in Fixing Production System Issues - ML 144 4 weeks ago | redcircle.com

MLOps 101: Scoping, Latency, Data Curation, and Continuous Model Retraining - ML 143 1 month ago | redcircle.com

Navigating Authority and Transparency in Organizations - ML 142 1 month, 3 weeks ago | redcircle.com

Evolution of Dlib: Addressing Challenges in Machine Learning and Computer Vision - ML 141 2 months, 1 week ago | redcircle.com

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

View on ai-jobs.net

Enterprise Data Architect

@ Pathward | Remote

View on ai-jobs.net

Diagnostic Imaging Information Systems (DIIS) Technologist

@ Nova Scotia Health Authority | Halifax, NS, CA, B3K 6R8

View on ai-jobs.net

Intern Data Scientist - Residual Value Risk Management (f/m/d)

@ BMW Group | Munich, DE

View on ai-jobs.net

Analytics Engineering Manager

@ PlayStation Global | United Kingdom, London

View on ai-jobs.net

Junior Insight Analyst (PR&Comms)

@ Signal AI | Lisbon, Lisbon, Portugal

View on ai-jobs.net

View more jobs

all AI news

Apache Spark (Pt. 2): MLlib - ML 074

More from redcircle.com / Adventures in Machine Learning

Jobs in AI, ML, Big Data

Data Scientist (m/f/x/d)

Enterprise Data Architect

Diagnostic Imaging Information Systems (DIIS) Technologist

Intern Data Scientist - Residual Value Risk Management (f/m/d)

Analytics Engineering Manager

Junior Insight Analyst (PR&Comms)