March 5, 2024, 2:45 p.m. | Yubo Zhuang, Xiaohui Chen, Yun Yang, Richard Y. Zhang

cs.LG updates on arXiv.org arxiv.org

arXiv:2305.18436v3 Announce Type: replace-cross
Abstract: $K$-means clustering is a widely used machine learning method for identifying patterns in large datasets. Semidefinite programming (SDP) relaxations have recently been proposed for solving the $K$-means optimization problem that enjoy strong statistical optimality guarantees, but the prohibitive cost of implementing an SDP solver renders these guarantees inaccessible to practical datasets. By contrast, nonnegative matrix factorization (NMF) is a simple clustering algorithm that is widely used by machine learning practitioners, but without a solid statistical …

abstract arxiv clustering cost cs.lg datasets k-means large datasets low machine machine learning math.oc optimization patterns programming solver statistical stat.ml type via

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote