A Penalized Shared-parameter Algorithm for Estimating Optimal Dynamic Treatment Regimens. (arXiv:2107.07875v2 [stat.ML] UPDATED) | allainews.com

May 27, 2022, 1:11 a.m. | Palash Ghosh, Trikay Nalamada, Shruti Agarwal, Maria Jahja, Bibhas Chakraborty

stat.ML updates on arXiv.org arxiv.org

A dynamic treatment regimen (DTR) is a set of decision rules to personalize
treatments for an individual using their medical history. The Q-learning based
Q-shared algorithm has been used to develop DTRs that involve decision rules
shared across multiple stages of intervention. We show that the existing
Q-shared algorithm can suffer from non-convergence due to the use of linear
models in the Q-learning setup, and identify the condition in which Q-shared
fails. Leveraging properties from expansion-constrained ordinary least-squares,
we give …

algorithm arxiv ml treatment

More from arxiv.org / stat.ML updates on arXiv.org

Simultaneous upper and lower bounds of American option prices with hedging via neural networks 1 day, 7 hours ago | arxiv.org

abstract arxiv form math.pr +11

Distributional Preference Learning: Understanding and Accounting for Hidden Context in RLHF 2 days, 7 hours ago | arxiv.org

accounting arxiv context cs.ai +6

Hacking Task Confounder in Meta-Learning 2 days, 7 hours ago | arxiv.org

abstract arxiv cs.lg hacking +12

Reflection coupling for unadjusted generalized Hamiltonian Monte Carlo in the nonconvex stochastic gradient case 2 days, 7 hours ago | arxiv.org

abstract algorithms arxiv case +10

Provable Reward-Agnostic Preference-Based Reinforcement Learning 2 days, 7 hours ago | arxiv.org

abstract agent arxiv cs.ai +16

Mastering Diverse Domains through World Models 2 days, 7 hours ago | arxiv.org

abstract algorithm algorithms application +22

Precise Asymptotics for Spectral Methods in Mixed Generalized Linear Models 2 days, 7 hours ago | arxiv.org

abstract arxiv cs.it cs.lg +14

Additive Covariance Matrix Models: Modelling Regional Electricity Net-Demand in Great Britain 2 days, 7 hours ago | arxiv.org

abstract arxiv britain consumption +18

Learning Algorithm Generalization Error Bounds via Auxiliary Distributions 2 days, 7 hours ago | arxiv.org

abstract algorithm arxiv cs.it +16

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Lead Software Engineer - Artificial Intelligence, LLM

@ OpenText | Hyderabad, TG, IN

View on ai-jobs.net

Lead Software Engineer- Python Data Engineer

@ JPMorgan Chase & Co. | GLASGOW, LANARKSHIRE, United Kingdom

View on ai-jobs.net

Data Analyst (m/w/d)

@ Collaboration Betters The World | Berlin, Germany

View on ai-jobs.net

Data Engineer, Quality Assurance

@ Informa Group Plc. | Boulder, CO, United States

View on ai-jobs.net

Director, Data Science - Marketing

@ Dropbox | Remote - Canada

View on ai-jobs.net