June 10, 2024, 4:44 a.m. | Easton K. Huch, Jieru Shi, Madeline R. Abbott, Jessica R. Golbus, Alexander Moreno, Walter H. Dempsey

stat.ML updates on arXiv.org arxiv.org

arXiv:2312.06403v3 Announce Type: replace
Abstract: Mobile health leverages personalized, contextually-tailored interventions optimized through bandit and reinforcement learning algorithms. Despite its promise, challenges like participant heterogeneity, nonstationarity, and nonlinearity in rewards hinder algorithm performance. We propose a robust contextual bandit algorithm, termed "DML-TS-NNR", that simultaneously addresses these challenges via (1) modeling the differential reward with user- and time-specific incidental parameters, (2) network cohesion penalties, and (3) debiased machine learning for flexible estimation of baseline rewards. We establish a high-probability regret bound …

abstract algorithm algorithms arxiv challenges cs.lg dml effects health hinder mixed mixed-effects mobile modeling performance personalized reinforcement reinforcement learning replace robust stat.ml through type via

Senior Clinical Data Scientist

@ Novartis | Home Worker

R&D Senior Data Scientist 1

@ Jotun | Sandefjord

Data Scientist - Corporate Audit, Officer

@ State Street | Toronto, Ontario

Senior Manager, Data Science & Analytics Solutions - Safety

@ Hyundai Motor America | Fountain Valley, CA, US, 92708

Data Science Working Student (all genders)

@ Merck Group | Darmstadt, Hessen, DE, 64293

Senior Data Scientist (m/f/d)

@ BASF | Limburgerhof, DE