all AI news
Clustering with missing data: which equivalent for Rubin's rules?. (arXiv:2011.13694v2 [stat.ME] UPDATED)
May 16, 2022, 1:10 a.m. | Vincent Audigier, Ndèye Niang
stat.ML updates on arXiv.org arxiv.org
Multiple imputation (MI) is a popular method for dealing with missing values.
However, the suitable way for applying clustering after MI remains unclear: how
to pool partitions? How to assess the clustering instability when data are
incomplete? By answering both questions, this paper proposed a complete view of
clustering with missing data using MI. The problem of partitions pooling is
here addressed using consensus clustering while, based on the bootstrap theory,
we explain how to assess the instability related to …
More from arxiv.org / stat.ML updates on arXiv.org
Jobs in AI, ML, Big Data
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Social Insights & Data Analyst (Freelance)
@ Media.Monks | Jakarta
Cloud Data Engineer
@ Arkatechture | Portland, ME, USA