Aug. 2, 2022, 2:11 a.m. | Robi Bhattacharjee, Jacob Imola, Michal Moshkovitz, Sanjoy Dasgupta

cs.LG updates on arXiv.org arxiv.org

We consider online $k$-means clustering where each new point is assigned to
the nearest cluster center, after which the algorithm may update its centers.
The loss incurred is the sum of squared distances from new points to their
assigned cluster centers. The goal over a data stream $X$ is to achieve loss
that is a constant factor of $L(X, OPT_k)$, the best possible loss using $k$
fixed points in hindsight.


We propose a data parameter, $\Lambda(X)$, such that for any …

arxiv clustering data data streams lg

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)

@ HelloBetter | Remote

Doctoral Researcher (m/f/div) in Automated Processing of Bioimages

@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena

Seeking Developers and Engineers for AI T-Shirt Generator Project

@ Chevon Hicks | Remote

Senior Applied Data Scientist

@ dunnhumby | London

Principal Data Architect - Azure & Big Data

@ MGM Resorts International | Home Office - US, NV