Feb. 28, 2024, 5:43 a.m. | Jianmin Chen, Robert H. Aseltine, Fei Wang, Kun Chen

cs.LG updates on arXiv.org arxiv.org

arXiv:2206.09107v2 Announce Type: replace
Abstract: Statistical learning with a large number of rare binary features is commonly encountered in analyzing electronic health records (EHR) data, especially in the modeling of disease onset with prior medical diagnoses and procedures. Dealing with the resulting highly sparse and large-scale binary feature matrix is notoriously challenging as conventional methods may suffer from a lack of power in testing and inconsistency in model fitting while machine learning methods may suffer from the inability of producing …

abstract aggregation arxiv binary cs.lg data disease ehr electronic electronic health records feature features feature selection health logic medical modeling prior records scale stat.ap statistical stat.me stat.ml tree type

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Director, Clinical Data Science

@ Aura | Remote USA

Research Scientist, AI (PhD)

@ Meta | Menlo Park, CA | New York City