March 20, 2024, 4:43 a.m. | Hamsa Bastani, Osbert Bastani, Wichinpong Park Sinchaisri

cs.LG updates on arXiv.org arxiv.org

arXiv:2108.08454v5 Announce Type: replace
Abstract: Workers spend a significant amount of time learning how to make good decisions. Evaluating the efficacy of a given decision, however, can be complicated -- e.g., decision outcomes are often long-term and relate to the original decision in complex ways. Surprisingly, even though learning good decision-making strategies is difficult, they can often be expressed in simple and concise forms. Focusing on sequential decision-making, we design a novel machine learning algorithm that is capable of extracting …

abstract arxiv cs.hc cs.lg decision decisions good however human long-term making reinforcement reinforcement learning spend strategies type workers

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne