Feb. 21, 2022, 2:11 a.m. | Tianning Xu, Ruoqing Zhu, Xiaofeng Shao

cs.LG updates on arXiv.org arxiv.org

Ensemble methods based on subsampling, such as random forests, are popular in
applications due to their high predictive accuracy. Existing literature views a
random forest prediction as an infinite-order incomplete U-statistic to
quantify its uncertainty. However, these methods focus on a small subsampling
size of each tree, which is theoretically valid but practically limited. This
paper develops an unbiased variance estimator based on incomplete U-statistics,
which allows the tree size to be comparable with the overall sample size,
making statistical …

arxiv ml random random forests variance

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Technology Consultant Master Data Management (w/m/d)

@ SAP | Walldorf, DE, 69190

Research Engineer, Computer Vision, Google Research

@ Google | Nairobi, Kenya