all AI news
Parallelize your massive SHAP computations with MLlib and PySpark
June 6, 2022, 8:46 p.m. | Aneesh Bose
Towards Data Science - Medium towardsdatascience.com
A stepwise guide for efficiently explaining your models using SHAP.
Photo by Pietro Jeng on UnsplashIntroduction to MLlib
Apache Spark’s Machine Learning Library (MLlib) is designed primarily for scalability and speed by leveraging the Spark runtime for common distributed use cases in supervised learning like classification and regression, unsupervised learning like clustering and collaborative filtering and in other cases like dimensionality reduction. In this article, I cover how we can use SHAP to explain a Gradient Boosted Trees (GBT) …
explainable ai machine learning massive mllib pyspark shap shapley-values spark-mllib
More from towardsdatascience.com / Towards Data Science - Medium
Jobs in AI, ML, Big Data
AI Research Scientist
@ Vara | Berlin, Germany and Remote
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Data Science Analyst
@ Mayo Clinic | AZ, United States
Sr. Data Scientist (Network Engineering)
@ SpaceX | Redmond, WA