April 7, 2022, 12:24 p.m. | /u/eternalmathstudent

Data Science www.reddit.com

For instance, we use Bernoulli when only the presence or absence of the word matters and frequencies are irrelevant and go with Multinomial when even the frequencies of the words are to be taken in to consideration. But my main question is, how to choose the right variant of NB in other non-text classification problems? For instance, in this mushroom classification [https://www.kaggle.com/datasets/uciml/mushroom-classification](https://www.kaggle.com/datasets/uciml/mushroom-classification) problem all the features are categorical? How should I transform this data for NB? Should I use get\_dummies …

bayes classification context datascience learning multinomial text text classification

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)

@ HelloBetter | Remote

Doctoral Researcher (m/f/div) in Automated Processing of Bioimages

@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena

Seeking Developers and Engineers for AI T-Shirt Generator Project

@ Chevon Hicks | Remote

Principal Data Architect - Azure & Big Data

@ MGM Resorts International | Home Office - US, NV

GN SONG MT Market Research Data Analyst 11

@ Accenture | Bengaluru, BDC7A