Strategic Decision-Making in the Presence of Information Asymmetry: Provably Efficient RL with Algorithmic Instruments. (arXiv:2208.11040v1 [stat.ML]) | allainews.com

Aug. 24, 2022, 1:12 a.m. | Mengxin Yu, Zhuoran Yang, Jianqing Fan

stat.ML updates on arXiv.org arxiv.org

We study offline reinforcement learning under a novel model called strategic
MDP, which characterizes the strategic interactions between a principal and a
sequence of myopic agents with private types. Due to the bilevel structure and
private types, strategic MDP involves information asymmetry between the
principal and the agents. We focus on the offline RL problem, where the goal is
to learn the optimal policy of the principal concerning a target population of
agents based on a pre-collected dataset that consists …

arxiv decision information making ml rl

More from arxiv.org / stat.ML updates on arXiv.org

Non-asymptotic estimates for accelerated high order Langevin Monte Carlo algorithms 16 hours ago | arxiv.org

abstract algorithms arxiv convergence +9

Entropic covariance models 1 day, 16 hours ago | arxiv.org

abstract arxiv challenges covariance +12

Bump hunting through density curvature features 1 day, 16 hours ago | arxiv.org

abstract arxiv construct data +18

Uncertainty quantification in metric spaces 1 day, 16 hours ago | arxiv.org

abstract algorithms arxiv datasets +15

Guiding adaptive shrinkage by co-data to improve regression-based prediction and feature selection 1 day, 16 hours ago | arxiv.org

abstract arxiv clinical data +17

A general error analysis for randomized low-rank approximation with application to data assimilation 1 day, 16 hours ago | arxiv.org

abstract algebra algorithms analysis +17

Calabi-Yau Four/Five/Six-folds as $\mathbb{P}^n_\textbf{w}$ Hypersurfaces: Machine Learning, Approximation, and Generation 2 days, 16 hours ago | arxiv.org

abstract approximation arxiv five +17

Bayesian Quantile Regression with Subset Selection: A Posterior Summarization Perspective 2 days, 16 hours ago | arxiv.org

abstract arxiv bayesian distribution +16

The Projected Covariance Measure for assumption-lean variable significance testing 2 days, 16 hours ago | arxiv.org

abstract arxiv covariance lean +14

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net