A general theory for robust clustering via trimmed mean | allainews.com

Feb. 6, 2024, 5:50 a.m. | Soham Jana Jianqing Fan Sanjeev Kulkarni

stat.ML updates on arXiv.org arxiv.org

Clustering is a fundamental tool in statistical machine learning in the presence of heterogeneous data. Many recent results focus primarily on optimal mislabeling guarantees, when data are distributed around centroids with sub-Gaussian errors. Yet, the restrictive sub-Gaussian model is often invalid in practice, since various real-world applications exhibit heavy tail distributions around the centroids or suffer from possible adversarial attacks that call for robust clustering with a robust data-driven initialization. In this paper, we introduce a hybrid clustering technique with …

applications clustering data distributed errors focus general machine machine learning math.st mean practice restrictive robust statistical stat.ml stat.th theory tool via world

More from arxiv.org / stat.ML updates on arXiv.org

Nuisance Function Tuning for Optimal Doubly Robust Estimation 2 days, 20 hours ago | arxiv.org

abstract arxiv convergence function +12

Fast Topological Signal Identification and Persistent Cohomological Cycle Matching 2 days, 20 hours ago | arxiv.org

abstract analysis applications art +20

Neural Networks for Extreme Quantile Regression with an Application to Forecasting of Flood Risk 2 days, 20 hours ago | arxiv.org

abstract application arxiv assessment +17

The High Line: Exact Risk and Learning Rate Curves of Stochastic Adaptive Learning Rate Algorithms 2 days, 20 hours ago | arxiv.org

abstract algorithms arxiv call +15

Comparison of Point Process Learning and its special case Takacs-Fiksel estimation 2 days, 20 hours ago | arxiv.org

abstract arxiv case comparison +14

Algorithmically Designed Artificial Neural Networks (ADANNs): Higher order deep operator learning for parametric partial differential … 3 days, 20 hours ago | arxiv.org

abstract ann architectures article +18

Adaptive posterior concentration rates for sparse high-dimensional linear regression with random design and unknown error … 3 days, 20 hours ago | arxiv.org

abstract analyze arxiv design +13

CHANI: Correlation-based Hawkes Aggregation of Neurons with bio-Inspiration 3 days, 20 hours ago | arxiv.org

abstract aggregation arxiv bio +14

Principled Probabilistic Imaging using Diffusion Models as Plug-and-Play Priors 3 days, 20 hours ago | arxiv.org

abstract arxiv bayesian capability +15

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

View on ai-jobs.net

ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)

@ HelloBetter | Remote

View on ai-jobs.net

Doctoral Researcher (m/f/div) in Automated Processing of Bioimages

@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena

View on ai-jobs.net

Seeking Developers and Engineers for AI T-Shirt Generator Project

@ Chevon Hicks | Remote

View on ai-jobs.net

Principal Data Architect - Azure & Big Data

@ MGM Resorts International | Home Office - US, NV

View on ai-jobs.net

GN SONG MT Market Research Data Analyst 11

@ Accenture | Bengaluru, BDC7A

View on ai-jobs.net