Apache Spark (Pt. 2): MLlib - ML 074

June 2, 2022, 10 a.m. | Ben Wilson, Michael Berk

Adventures in Machine Learning redcircle.com

MLlib is Apache Spark's scalable machine learning library. Today, Ben and Michael discuss the ease of use, performance, algorithms, and utilities included in this library and how to execute the best ML workflow with MLlib.

In this episode...

Why stick with Spark libraries vs. a single node operation?

What algorithms are not in Spark Lib?

What is the min. package set to use for supervised learning?

Modeling and validation

Down-sampling your data

MLlib vs. scikit-learn

Resources

Sponsors

Top End Devs …

algorithms apache apache spark discuss libraries library machine machine learning mllib node performance scalable spark utilities workflow

Visit resource

More from redcircle.com / Adventures in Machine Learning

The Impact of AI Tools on Software Development and Quality Assurance - ML 130 1 day, 20 hours ago | redcircle.com

Harnessing Open Source Contributions in Machine Learning and Quantization - ML 148 2 weeks, 1 day ago | redcircle.com

Adaptive Industry ML: Challenges, Automation, and Model Applications - ML 149 2 weeks, 1 day ago | redcircle.com

Data Platform Innovation: Navigating Challenges and Building a Unified Experience - ML 147 3 weeks, 1 day ago | redcircle.com

The Science-Engineering Blend - ML 146 4 weeks, 1 day ago | redcircle.com

The Impact of Process on Successful Tech Companies - ML 145 1 month ago | redcircle.com

Delivering Scoped Solutions: Lessons in Fixing Production System Issues - ML 144 1 month, 1 week ago | redcircle.com

MLOps 101: Scoping, Latency, Data Curation, and Continuous Model Retraining - ML 143 1 month, 2 weeks ago | redcircle.com

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Data Engineer - Takealot Group (Takealot.com | Superbalist.com | Mr D Food)

@ takealot.com | Cape Town

View on ai-jobs.net

View more jobs

all AI news

Apache Spark (Pt. 2): MLlib - ML 074

More from redcircle.com / Adventures in Machine Learning

Jobs in AI, ML, Big Data

AI Engineer Intern, Agents

AI Research Scientist

Data Architect

Data ETL Engineer

Lead GNSS Data Scientist

Data Engineer - Takealot Group (Takealot.com | Superbalist.com | Mr D Food)