Apache Spark (Pt. 2): MLlib - ML 074

June 2, 2022, 10 a.m. | Ben Wilson, Michael Berk

Adventures in Machine Learning redcircle.com

MLlib is Apache Spark's scalable machine learning library. Today, Ben and Michael discuss the ease of use, performance, algorithms, and utilities included in this library and how to execute the best ML workflow with MLlib.

In this episode...

Why stick with Spark libraries vs. a single node operation?

What algorithms are not in Spark Lib?

What is the min. package set to use for supervised learning?

Modeling and validation

Down-sampling your data

MLlib vs. scikit-learn

Resources

Sponsors

Top End Devs …

algorithms apache apache spark discuss libraries library machine machine learning mllib ml workflow node performance scalable spark utilities workflow

Visit resource

More from redcircle.com / Adventures in Machine Learning

The Journey to Expertise with Fernando Lopez - ML 152 1 week, 3 days ago | redcircle.com

Unraveling the Complexities of Model Deployment in Dynamic Marketplaces - ML 151 3 weeks, 3 days ago | redcircle.com

The Impact of AI Tools on Software Development and Quality Assurance - ML 130 1 month ago | redcircle.com

Harnessing Open Source Contributions in Machine Learning and Quantization - ML 148 1 month, 2 weeks ago | redcircle.com

Adaptive Industry ML: Challenges, Automation, and Model Applications - ML 149 1 month, 2 weeks ago | redcircle.com

Data Platform Innovation: Navigating Challenges and Building a Unified Experience - ML 147 1 month, 3 weeks ago | redcircle.com

The Science-Engineering Blend - ML 146 1 month, 4 weeks ago | redcircle.com

The Impact of Process on Successful Tech Companies - ML 145 2 months ago | redcircle.com

Delivering Scoped Solutions: Lessons in Fixing Production System Issues - ML 144 2 months, 1 week ago | redcircle.com

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

View on ai-jobs.net

ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)

@ HelloBetter | Remote

View on ai-jobs.net

Doctoral Researcher (m/f/div) in Automated Processing of Bioimages

@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena

View on ai-jobs.net

Seeking Developers and Engineers for AI T-Shirt Generator Project

@ Chevon Hicks | Remote

View on ai-jobs.net

Senior Applied Data Scientist

@ dunnhumby | London

View on ai-jobs.net

Principal Data Architect - Azure & Big Data

@ MGM Resorts International | Home Office - US, NV

View on ai-jobs.net

all AI news

Apache Spark (Pt. 2): MLlib - ML 074

More from redcircle.com / Adventures in Machine Learning

Jobs in AI, ML, Big Data

Senior Machine Learning Engineer

ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)

Doctoral Researcher (m/f/div) in Automated Processing of Bioimages

Seeking Developers and Engineers for AI T-Shirt Generator Project

Senior Applied Data Scientist

Principal Data Architect - Azure & Big Data