April 26, 2023, 2:23 p.m. | /u/InjuryNeat7483

Data Science www.reddit.com

I'm in the process of creating a customer segmentation with the hope of clustering by customer behavior, rather than engagement. Because of this, I was going to use more binary variables than continuous variables, which has led me to use K-Prototypes.

I'm curious if you've done this, what issues if any did you have? Someone told me that using more binary variables can make it difficult to cluster, but I've yet to find anything online that supports this. Of course, …

behavior binary cluster clustering continuous course customer datascience engagement process segmentation true variables

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Data Engineer

@ Parker | New York City

Sr. Data Analyst | Home Solutions

@ Three Ships | Raleigh or Charlotte, NC