Univariate K-Means Clustering vs. Fixed Cluster Boundaries | allainews.com

April 1, 2024, 1:57 p.m. | /u/bernful

Data Science www.reddit.com

I am attempting to cluster stores based off their sales. I can either do:

1. Univariate K-Means clustering by way of the Ckmeans.1d.dp package in R. This works perfectly fine, only 2 cons are figuring out the upper limit on K, and possibly explainability to the client.
2. Fixed cluster boundaries. In this case, I average the sales of all stores, and create boundaries like: 50% below average, 25% below average, 25% above average, 50% above average. This is …

client cluster clustering cons datascience explainability k-means package sales stores

More from www.reddit.com / Data Science

Do multimodal LLMs use classical OCR text recognition under the hood for interpreting text? 6 hours ago | www.reddit.com

datascience extract features foundational +15

Is there a tutorial to create your own PyTorch Module (Linear), Loss (Least Squares), and … 13 hours ago | www.reddit.com

academic create datascience easy +8

Took a couple years off to travel and do personal projects, while contracting for about … 1 day, 5 hours ago | www.reddit.com

contracting data datascience data scientist +12

Do I need to know How to write algorithms from scratch if I want to … 1 day, 9 hours ago | www.reddit.com

algorithms code data datascience +5

Questions to ask and what to look for when interviewing to gauge the "technical culture" … 1 day, 14 hours ago | www.reddit.com

analyst culture datascience employees +14

Do you have both a ML engineer and a MLOps engineer on your team? If … 1 day, 16 hours ago | www.reddit.com

datascience difference engineer engineering +10

Have Data Scientist Interviews Evolved Over the Last Year? 1 day, 20 hours ago | www.reddit.com

access become change companies +17

Tell me about older individual contributors 2 days, 1 hour ago | www.reddit.com

cap contributors data datascience +6

Pedro Thermo Similarity vs Levenshtain/ OSA/ Jaro/ .. 2 days, 3 hours ago | www.reddit.com

algorithm algorithms alternative datascience +4

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net