April 2, 2024, 7:50 p.m. | Leda Wang, Zhixiang Zhang, Edgar Dobriban

stat.ML updates on arXiv.org arxiv.org

arXiv:2404.00912v1 Announce Type: cross
Abstract: Randomized algorithms can be used to speed up the analysis of large datasets. In this paper, we develop a unified methodology for statistical inference via randomized sketching or projections in two of the most fundamental problems in multivariate statistical analysis: least squares and PCA. The methodology applies to fixed datasets -- i.e., is data-conditional -- and the only randomness is due to the randomized algorithm. We propose statistical inference methods for a broad range of …

abstract algorithms analysis arxiv datasets forms inference large datasets least math.st methodology multivariate normality paper pca speed squares stat.co statistical stat.me stat.ml stat.th type via

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Senior Principal, Product Strategy Operations, Cloud Data Analytics

@ Google | Sunnyvale, CA, USA; Austin, TX, USA

Data Scientist - HR BU

@ ServiceNow | Hyderabad, India