all AI news
Finding Outliers in Gaussian Model-Based Clustering
April 8, 2024, 4:45 a.m. | Katharine M. Clark, Paul D. McNicholas
stat.ML updates on arXiv.org arxiv.org
Abstract: Clustering, or unsupervised classification, is a task often plagued by outliers. Yet there is a paucity of work on handling outliers in clustering. Outlier identification algorithms tend to fall into three broad categories: outlier inclusion, outlier trimming, and \textit{post hoc} outlier identification methods, with the former two often requiring pre-specification of the number of outliers. The fact that sample Mahalanobis distance is beta-distributed is used to derive an approximate distribution for the log-likelihoods of subset …
abstract algorithms arxiv classification clustering identification inclusion outlier outliers stat.me stat.ml type unsupervised work
More from arxiv.org / stat.ML updates on arXiv.org
Nuisance Function Tuning for Optimal Doubly Robust Estimation
2 days, 17 hours ago |
arxiv.org
CHANI: Correlation-based Hawkes Aggregation of Neurons with bio-Inspiration
3 days, 17 hours ago |
arxiv.org
Jobs in AI, ML, Big Data
Senior Machine Learning Engineer
@ GPTZero | Toronto, Canada
ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)
@ HelloBetter | Remote
Doctoral Researcher (m/f/div) in Automated Processing of Bioimages
@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena
Seeking Developers and Engineers for AI T-Shirt Generator Project
@ Chevon Hicks | Remote
Principal Data Architect - Azure & Big Data
@ MGM Resorts International | Home Office - US, NV
GN SONG MT Market Research Data Analyst 11
@ Accenture | Bengaluru, BDC7A