Structural Estimation of Markov Decision Processes in High-Dimensional State Space with Finite-Time Guarantees | allainews.com

March 4, 2024, 5:42 a.m. | Siliang Zeng, Mingyi Hong, Alfredo Garcia

cs.LG updates on arXiv.org arxiv.org

arXiv:2210.01282v3 Announce Type: replace
Abstract: We consider the task of estimating a structural model of dynamic decisions by a human agent based upon the observable history of implemented actions and visited states. This problem has an inherent nested structure: in the inner problem, an optimal policy for a given reward function is identified while in the outer problem, a measure of fit is maximized. Several approaches have been proposed to alleviate the computational burden of this nested-loop structure, but these …

abstract agent arxiv cs.ai cs.lg decision decisions dynamic econ.em history human markov observable policy processes space state stat.ml type

More from arxiv.org / cs.LG updates on arXiv.org

Marabou 2.0: A Versatile Formal Analyzer of Neural Networks 14 hours ago | arxiv.org

abstract analysis arxiv components +16

Metric Entropy-Free Sample Complexity Bounds for Sample Average Approximation in Convex Stochastic Programming 14 hours ago | arxiv.org

abstract approximation arxiv complexity +15

FengWu-4DVar: Coupling the Data-driven Weather Forecasting Model with 4D Variational Assimilation 14 hours ago | arxiv.org

abstract artificial artificial intelligence arxiv +16

Image Restoration Through Generalized Ornstein-Uhlenbeck Bridge 14 hours ago | arxiv.org

arxiv bridge cs.ai cs.cv +8

Learn or Recall? Revisiting Incremental Learning with Pre-trained Language Models 14 hours ago | arxiv.org

arxiv cs.cl cs.lg incremental +7

System-level Safety Guard: Safe Tracking Control through Uncertain Neural Network Dynamics Models 14 hours ago | arxiv.org

arxiv control cs.lg cs.ro +13

Structured state-space models are deep Wiener models 14 hours ago | arxiv.org

abstract arxiv become classification +16

Differentiable and accelerated spherical harmonic and Wigner transforms 14 hours ago | arxiv.org

abstract analysis and analysis arxiv +16

Stable Attractors for Neural networks classification via Ordinary Differential Equations (SA-nODE) 14 hours ago | arxiv.org

abstract arxiv classification cond-mat.dis-nn +18

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Intern - Robotics Industrial Engineer Summer 2024

@ Vitesco Technologies | Seguin, US

View on ai-jobs.net