Aug. 5, 2022, 1:11 a.m. | Mark Sendak, Gaurav Sirdeshmukh, Timothy Ochoa, Hayley Premo, Linda Tang, Kira Niederhoffer, Sarah Reed, Kaivalya Deshpande, Emily Sterrett, Melissa B

stat.ML updates on arXiv.org arxiv.org

The approaches by which the machine learning and clinical research
communities utilize real world data (RWD), including data captured in the
electronic health record (EHR), vary dramatically. While clinical researchers
cautiously use RWD for clinical investigations, ML for healthcare teams consume
public datasets with minimal scrutiny to develop new algorithms. This study
bridges this gap by developing and validating ML-DQA, a data quality assurance
framework grounded in RWD best practices. The ML-DQA framework is applied to
five ML projects across …

arxiv data data quality development framework healthcare learning machine machine learning ml quality quality assurance validation

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne