Feb. 6, 2024, 5:49 a.m. | Kentaro Hoffman Stephen Salerno Awan Afiaz Jeffrey T. Leek Tyler H. McCormick

cs.LG updates on arXiv.org arxiv.org

As artificial intelligence and machine learning tools become more accessible, and scientists face new obstacles to data collection (e.g. rising costs, declining survey response rates), researchers increasingly use predictions from pre-trained algorithms as outcome variables. Though appealing for financial and logistical reasons, using standard tools for inference can misrepresent the association between independent variables and the outcome of interest when the true, unobserved outcome is replaced by a predicted value. In this paper, we characterize the statistical challenges inherent to …

algorithms artificial artificial intelligence association become collection costs cs.lg data data collection face financial independent inference intelligence machine machine learning obstacles predictions researchers scientists standard stat.me survey tools variables

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Data Scientist

@ Publicis Groupe | New York City, United States

Bigdata Cloud Developer - Spark - Assistant Manager

@ State Street | Hyderabad, India