Learning in Markov Decision Processes under Constraints. (arXiv:2002.12435v5 [cs.LG] UPDATED) | allainews.com

Jan. 7, 2022, 2:10 a.m. | Rahul Singh, Abhishek Gupta, Ness B. Shroff

cs.LG updates on arXiv.org arxiv.org

We consider reinforcement learning (RL) in Markov Decision Processes in which
an agent repeatedly interacts with an environment that is modeled by a
controlled Markov process. At each time step $t$, it earns a reward, and also
incurs a cost-vector consisting of $M$ costs. We design model-based RL
algorithms that maximize the cumulative reward earned over a time horizon of
$T$ time-steps, while simultaneously ensuring that the average values of the
$M$ cost expenditures are bounded by agent-specified thresholds
$c^{ub}_i,i=1,2,\ldots,M$. …

arxiv decision learning markov processes

More from arxiv.org / cs.LG updates on arXiv.org

Stochastic Optimal Control Matching 1 day, 6 hours ago | arxiv.org

arxiv control cs.lg cs.na +6

Value Approximation for Two-Player General-Sum Differential Games with State Constraints 1 day, 6 hours ago | arxiv.org

abstract approximation arxiv constraints +20

Can We Edit Multimodal Large Language Models? 1 day, 6 hours ago | arxiv.org

arxiv cs.ai cs.cl cs.cv +9

XIMAGENET-12: An Explainable AI Benchmark Dataset for Model Robustness Evaluation 1 day, 6 hours ago | arxiv.org

ai benchmark arxiv benchmark cs.cv +7

Generalized Schr\"odinger Bridge Matching 1 day, 6 hours ago | arxiv.org

arxiv bridge cs.lg generalized +3

Tight bounds on Pauli channel learning without entanglement 1 day, 6 hours ago | arxiv.org

abstract algorithms arxiv cs.it +9

Automated mapping of virtual environments with visual predictive coding 1 day, 6 hours ago | arxiv.org

abstract access algorithms arxiv +28

Confident Feature Ranking 1 day, 6 hours ago | arxiv.org

abstract arxiv cs.ai cs.lg +14

Integrated Sensing-Communication-Computation for Edge Artificial Intelligence 1 day, 6 hours ago | arxiv.org

abstract advanced and edge ai artificial +27

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Data Analyst - Associate

@ JPMorgan Chase & Co. | Mumbai, Maharashtra, India

View on ai-jobs.net

Staff Data Engineer (Data Platform)

@ Coupang | Seoul, South Korea

View on ai-jobs.net

AI/ML Engineering Research Internship

@ Keysight Technologies | Santa Rosa, CA, United States

View on ai-jobs.net

Sr. Director, Head of Data Management and Reporting Execution

@ Biogen | Cambridge, MA, United States

View on ai-jobs.net

Manager, Marketing - Audience Intelligence (Senior Data Analyst)

@ Delivery Hero | Singapore, Singapore

View on ai-jobs.net