Distributionally Robust Offline Reinforcement Learning with Linear Function Approximation. (arXiv:2209.06620v2 [cs.LG] UPDATED) | allainews.com

Sept. 30, 2022, 1:14 a.m. | Xiaoteng Ma, Zhipeng Liang, Jose Blanchet, Mingwen Liu, Li Xia, Jiheng Zhang, Qianchuan Zhao, Zhengyuan Zhou

stat.ML updates on arXiv.org arxiv.org

Among the reasons hindering reinforcement learning (RL) applications to
real-world problems, two factors are critical: limited data and the mismatch
between the testing environment (real environment in which the policy is
deployed) and the training environment (e.g., a simulator). This paper attempts
to address these issues simultaneously with distributionally robust offline RL,
where we learn a distributionally robust policy using historical data obtained
from the source environment by optimizing against a worst-case perturbation
thereof. In particular, we move beyond tabular …

approximation arxiv function linear offline reinforcement reinforcement learning

More from arxiv.org / stat.ML updates on arXiv.org

Estimation Sample Complexity of a Class of Nonlinear Continuous-time Systems 20 hours ago | arxiv.org

abstract arxiv class complexity +14

Estimation and Uniform Inference in Sparse High-Dimensional Additive Models 20 hours ago | arxiv.org

abstract arxiv confidence construct +9

GIST: Gibbs self-tuning for locally adaptive Hamiltonian Monte Carlo 20 hours ago | arxiv.org

abstract algorithm arxiv framework +13

Variational Bayesian surrogate modelling with application to robust design optimisation 20 hours ago | arxiv.org

abstract application approximation arxiv +20

Corrected generalized cross-validation for finite ensembles of penalized estimators 1 day, 20 hours ago | arxiv.org

abstract arxiv error freedom +13

Statistical Inference for Heterogeneous Treatment Effects Discovered by Generic Machine Learning in Randomized Experiments 1 day, 20 hours ago | arxiv.org

abstract algorithms arxiv causal +15

Asymptotic Validity and Finite-Sample Properties of Approximate Randomization Tests 1 day, 20 hours ago | arxiv.org

abstract arxiv data distribution +11

Preserving linear invariants in ensemble filtering methods 1 day, 20 hours ago | arxiv.org

abstract arxiv ensemble errors +13

Prediction of flow and elastic stresses in a viscoelastic turbulent channel flow using convolutional neural … 1 day, 20 hours ago | arxiv.org

abstract arxiv convolutional neural networks data +12

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Social Insights & Data Analyst (Freelance)

@ Media.Monks | Jakarta

View on ai-jobs.net

Cloud Data Engineer

@ Arkatechture | Portland, ME, USA

View on ai-jobs.net