April 18, 2022, 1:11 a.m. | Rustam Mussabayev, Nenad Mladenovic, Bassem Jarboui, Ravil Mussabayev

cs.LG updates on arXiv.org arxiv.org

K-means clustering plays a vital role in data mining. However, its
performance drastically drops when applied to huge amounts of data. We propose
a new heuristic that is built on the basis of regular K-means for faster and
more accurate big data clustering using the "less is more" and MSSC
decomposition approaches. The main advantage of the proposed algorithm is that
it naturally turns the K-means local search into global one through the process
of decomposition of the MSSC problem. …

arxiv big clustering k-means

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Data Engineer

@ Parker | New York City

Sr. Data Analyst | Home Solutions

@ Three Ships | Raleigh or Charlotte, NC