A general theory for robust clustering via trimmed mean | allainews.com

Feb. 6, 2024, 5:50 a.m. | Soham Jana Jianqing Fan Sanjeev Kulkarni

stat.ML updates on arXiv.org arxiv.org

Clustering is a fundamental tool in statistical machine learning in the presence of heterogeneous data. Many recent results focus primarily on optimal mislabeling guarantees, when data are distributed around centroids with sub-Gaussian errors. Yet, the restrictive sub-Gaussian model is often invalid in practice, since various real-world applications exhibit heavy tail distributions around the centroids or suffer from possible adversarial attacks that call for robust clustering with a robust data-driven initialization. In this paper, we introduce a hybrid clustering technique with …

applications clustering data distributed errors focus general machine machine learning math.st mean practice restrictive robust statistical stat.ml stat.th theory tool via world

More from arxiv.org / stat.ML updates on arXiv.org

Geometric-Based Pruning Rules For Change Point Detection in Multiple Independent Time Series 7 hours ago | arxiv.org

abstract algorithms arxiv change +20

Leave-one-out least squares Monte Carlo algorithm for pricing Bermudan options 7 hours ago | arxiv.org

abstract algorithm arxiv bias +13

Recursively Feasible Shrinking-Horizon MPC in Dynamic Environments with Conformal Prediction Guarantees 7 hours ago | arxiv.org

abstract agents arxiv autonomous +18

Model orthogonalization and Bayesian forecast mixing via Principal Component Analysis 7 hours ago | arxiv.org

abstract analysis arxiv bayesian +18

$\ell_1$-Regularized Generalized Least Squares 7 hours ago | arxiv.org

abstract accuracy arxiv autoregressive +15

The fast committor machine: Interpretable prediction with kernels 7 hours ago | arxiv.org

abstract arxiv cs.na dynamics +13

A Stable and Efficient Covariate-Balancing Estimator for Causal Survival Effects 3 days, 7 hours ago | arxiv.org

abstract art arxiv causal +12

Subgradient Convergence Implies Subdifferential Convergence on Weakly Convex Functions: With Uniform Rates Guarantees 3 days, 7 hours ago | arxiv.org

abstract arxiv challenge convergence +15

A Gaussian Process Model for Ordinal Data with Applications to Chemoinformatics 3 days, 7 hours ago | arxiv.org

abstract applications arxiv create +16

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net