April 25, 2024, 7:43 p.m. | Ali Turfah, Xiaoquan Wen

cs.LG updates on arXiv.org arxiv.org

arXiv:2404.15967v1 Announce Type: cross
Abstract: Cluster analysis is a popular unsupervised learning tool used in many disciplines to identify heterogeneous sub-populations within a sample. However, validating cluster analysis results and determining the number of clusters in a data set remains an outstanding problem. In this work, we present a global criterion called the Distinguishability criterion to quantify the separability of identified clusters and validate inferred cluster configurations. Our computational implementation of the Distinguishability criterion corresponds to the Bayes risk of …

abstract analysis arxiv cluster clustering criterion cs.lg data data set global however identify popular results sample set stat.me stat.ml tool type unsupervised unsupervised learning work

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

DevOps Engineer (Data Team)

@ Reward Gateway | Sofia/Plovdiv