Jan. 12, 2022, 7:31 p.m. | /u/diepala

Data Science www.reddit.com

So I have a model with a categorical variable encoded with one hot encoding, and I want to compute the feature importance of the categorical variable.

The feature importance is defined as the mean absolute value of the shap values. Then, to get the global importance of a categorical feature, should I just aggregate the values of all the components of the one hot encoding?

submitted by /u/diepala
[link] [comments]

computing datascience shap values

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Data Integration Specialist

@ Accenture Federal Services | San Antonio, TX

Geospatial Data Engineer - Location Intelligence

@ Allegro | Warsaw, Poland

Site Autonomy Engineer (Onsite)

@ May Mobility | Tokyo, Japan

Summer Intern, AI (Artificial Intelligence)

@ Nextech Systems | Tampa, FL

Permitting Specialist/Wetland Scientist

@ AECOM | Chelmsford, MA, United States