all AI news
Apache Spark (Pt. 2): MLlib - ML 074
June 2, 2022, 10 a.m. | Ben Wilson, Michael Berk
Adventures in Machine Learning redcircle.com
MLlib is Apache Spark's scalable machine learning library. Today, Ben and Michael discuss the ease of use, performance, algorithms, and utilities included in this library and how to execute the best ML workflow with MLlib.
In this episode...
- Why stick with Spark libraries vs. a single node operation?
- What algorithms are not in Spark Lib?
- What is the min. package set to use for supervised learning?
- Modeling and validation
- Down-sampling your data
- MLlib vs. scikit-learn
- Resources
Sponsors
algorithms apache apache spark discuss libraries library machine machine learning mllib ml workflow node performance scalable spark utilities workflow
More from redcircle.com / Adventures in Machine Learning
The Impact of AI Tools on Software Development and Quality Assurance - ML 130
1 day, 22 hours ago |
redcircle.com
The Science-Engineering Blend - ML 146
4 weeks, 1 day ago |
redcircle.com
Jobs in AI, ML, Big Data
Founding AI Engineer, Agents
@ Occam AI | New York
AI Engineer Intern, Agents
@ Occam AI | US
AI Research Scientist
@ Vara | Berlin, Germany and Remote
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne