Structural Estimation of Markov Decision Processes in High-Dimensional State Space with Finite-Time Guarantees | allainews.com

March 4, 2024, 5:42 a.m. | Siliang Zeng, Mingyi Hong, Alfredo Garcia

cs.LG updates on arXiv.org arxiv.org

arXiv:2210.01282v3 Announce Type: replace
Abstract: We consider the task of estimating a structural model of dynamic decisions by a human agent based upon the observable history of implemented actions and visited states. This problem has an inherent nested structure: in the inner problem, an optimal policy for a given reward function is identified while in the outer problem, a measure of fit is maximized. Several approaches have been proposed to alleviate the computational burden of this nested-loop structure, but these …

abstract agent arxiv cs.ai cs.lg decision decisions dynamic econ.em history human markov observable policy processes space state stat.ml type

More from arxiv.org / cs.LG updates on arXiv.org

(Accelerated) Noise-adaptive Stochastic Heavy-Ball Momentum 1 day, 17 hours ago | arxiv.org

abstract aim arxiv cs.lg +12

Nash Learning from Human Feedback 1 day, 17 hours ago | arxiv.org

abstract arxiv cs.ai cs.gt +20

GraphDreamer: Compositional 3D Scene Synthesis from Scene Graphs 1 day, 17 hours ago | arxiv.org

abstract arxiv become cs.cv +16

Trainwreck: A damaging adversarial attack on image classifiers 1 day, 17 hours ago | arxiv.org

adversarial arxiv classifiers cs.cr +5

Fast Controllable Diffusion Models for Undersampled MRI Reconstruction 1 day, 17 hours ago | arxiv.org

abstract acquisition arxiv cs.lg +13

MAD Max Beyond Single-Node: Enabling Large Machine Learning Model Acceleration on Distributed Systems 1 day, 17 hours ago | arxiv.org

abstract analysis arxiv beyond +24

From Classification to Segmentation with Explainable AI: A Study on Crack Detection and Growth Monitoring 1 day, 17 hours ago | arxiv.org

abstract arxiv classification cs.cv +22

Exploring Meta Information for Audio-based Zero-shot Bird Classification 1 day, 17 hours ago | arxiv.org

abstract advances arxiv audio +22

Occlusion-Aware Deep Convolutional Neural Network via Homogeneous Tanh-transforms for Face Parsing 1 day, 17 hours ago | arxiv.org

abstract arxiv become convolutional +16

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

View on ai-jobs.net

Sr. Data Operations

@ Carousell Group | West Jakarta, Indonesia

View on ai-jobs.net

Senior Analyst, Business Intelligence & Reporting

@ Deutsche Bank | Bucharest

View on ai-jobs.net

Business Intelligence Subject Matter Expert (SME) - Assistant Vice President

@ Deutsche Bank | Cary, 3000 CentreGreen Way

View on ai-jobs.net

Enterprise Business Intelligence Specialist

@ NAIC | Kansas City

View on ai-jobs.net

Senior Business Intelligence (BI) Developer - Associate

@ Deutsche Bank | Cary, 3000 CentreGreen Way

View on ai-jobs.net