Optimistic MLE -- A Generic Model-based Algorithm for Partially Observable Sequential Decision Making. (arXiv:2209.14997v3 [cs.LG] UPDATED) | allainews.com

Nov. 24, 2022, 7:14 a.m. | Qinghua Liu, Praneeth Netrapalli, Csaba Szepesvári, Chi Jin

stat.ML updates on arXiv.org arxiv.org

This paper introduces a simple efficient learning algorithms for general
sequential decision making. The algorithm combines Optimism for exploration
with Maximum Likelihood Estimation for model estimation, which is thus named
OMLE. We prove that OMLE learns the near-optimal policies of an enormously rich
class of sequential decision making problems in a polynomial number of samples.
This rich class includes not only a majority of known tractable model-based
Reinforcement Learning (RL) problems (such as tabular MDPs, factored MDPs, low
witness rank …

algorithm arxiv decision decision making making mle observable

More from arxiv.org / stat.ML updates on arXiv.org

Estimation Sample Complexity of a Class of Nonlinear Continuous-time Systems 1 day ago | arxiv.org

abstract arxiv class complexity +14

Estimation and Uniform Inference in Sparse High-Dimensional Additive Models 1 day ago | arxiv.org

abstract arxiv confidence construct +9

GIST: Gibbs self-tuning for locally adaptive Hamiltonian Monte Carlo 1 day ago | arxiv.org

abstract algorithm arxiv framework +13

Variational Bayesian surrogate modelling with application to robust design optimisation 1 day ago | arxiv.org

abstract application approximation arxiv +20

Corrected generalized cross-validation for finite ensembles of penalized estimators 2 days ago | arxiv.org

abstract arxiv error freedom +13

Statistical Inference for Heterogeneous Treatment Effects Discovered by Generic Machine Learning in Randomized Experiments 2 days ago | arxiv.org

abstract algorithms arxiv causal +15

Asymptotic Validity and Finite-Sample Properties of Approximate Randomization Tests 2 days ago | arxiv.org

abstract arxiv data distribution +11

Preserving linear invariants in ensemble filtering methods 2 days ago | arxiv.org

abstract arxiv ensemble errors +13

Prediction of flow and elastic stresses in a viscoelastic turbulent channel flow using convolutional neural … 2 days ago | arxiv.org

abstract arxiv convolutional neural networks data +12

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Data Engineer

@ Parker | New York City

View on ai-jobs.net

Sr. Data Analyst | Home Solutions

@ Three Ships | Raleigh or Charlotte, NC

View on ai-jobs.net