all AI news
Statistically Optimal K-means Clustering via Nonnegative Low-rank Semidefinite Programming
March 5, 2024, 2:45 p.m. | Yubo Zhuang, Xiaohui Chen, Yun Yang, Richard Y. Zhang
cs.LG updates on arXiv.org arxiv.org
Abstract: $K$-means clustering is a widely used machine learning method for identifying patterns in large datasets. Semidefinite programming (SDP) relaxations have recently been proposed for solving the $K$-means optimization problem that enjoy strong statistical optimality guarantees, but the prohibitive cost of implementing an SDP solver renders these guarantees inaccessible to practical datasets. By contrast, nonnegative matrix factorization (NMF) is a simple clustering algorithm that is widely used by machine learning practitioners, but without a solid statistical …
abstract arxiv clustering cost cs.lg datasets k-means large datasets low machine machine learning math.oc optimization patterns programming solver statistical stat.ml type via
More from arxiv.org / cs.LG updates on arXiv.org
Jobs in AI, ML, Big Data
Lead Developer (AI)
@ Cere Network | San Francisco, US
Research Engineer
@ Allora Labs | Remote
Ecosystem Manager
@ Allora Labs | Remote
Founding AI Engineer, Agents
@ Occam AI | New York
AI Engineer Intern, Agents
@ Occam AI | US
AI Research Scientist
@ Vara | Berlin, Germany and Remote