Optimal scheduling of entropy regulariser for continuous-time linear-quadratic reinforcement learning. (arXiv:2208.04466v2 [cs.LG] UPDATED) | allainews.com

Aug. 12, 2022, 1:11 a.m. | Lukasz Szpruch, Tanut Treetanthiploet, Yufei Zhang

cs.LG updates on arXiv.org arxiv.org

This work uses the entropy-regularised relaxed stochastic control perspective
as a principled framework for designing reinforcement learning (RL) algorithms.
Herein agent interacts with the environment by generating noisy controls
distributed according to the optimal relaxed policy. The noisy policies, on the
one hand, explore the space and hence facilitate learning but, on the other
hand, introduce bias by assigning a positive probability to non-optimal
actions. This exploration-exploitation trade-off is determined by the strength
of entropy regularisation. We study algorithms resulting …

arxiv continuous entropy learning lg linear reinforcement reinforcement learning scheduling time

More from arxiv.org / cs.LG updates on arXiv.org

Stochastic Optimal Control Matching 10 hours ago | arxiv.org

arxiv control cs.lg cs.na +6

Value Approximation for Two-Player General-Sum Differential Games with State Constraints 10 hours ago | arxiv.org

abstract approximation arxiv constraints +20

Can We Edit Multimodal Large Language Models? 10 hours ago | arxiv.org

arxiv cs.ai cs.cl cs.cv +9

XIMAGENET-12: An Explainable AI Benchmark Dataset for Model Robustness Evaluation 10 hours ago | arxiv.org

ai benchmark arxiv benchmark cs.cv +7

Generalized Schr\"odinger Bridge Matching 10 hours ago | arxiv.org

arxiv bridge cs.lg generalized +3

Tight bounds on Pauli channel learning without entanglement 10 hours ago | arxiv.org

abstract algorithms arxiv cs.it +9

Automated mapping of virtual environments with visual predictive coding 10 hours ago | arxiv.org

abstract access algorithms arxiv +28

Confident Feature Ranking 10 hours ago | arxiv.org

abstract arxiv cs.ai cs.lg +14

Integrated Sensing-Communication-Computation for Edge Artificial Intelligence 10 hours ago | arxiv.org

abstract advanced and edge ai artificial +27

Senior Marketing Data Analyst

@ Amazon.com | Amsterdam, North Holland, NLD

View on ai-jobs.net

Senior Data Analyst

@ MoneyLion | Kuala Lumpur, Kuala Lumpur, Malaysia

View on ai-jobs.net

Data Management Specialist - Office of the CDO - Chase- Associate

@ JPMorgan Chase & Co. | LONDON, LONDON, United Kingdom

View on ai-jobs.net

BI Data Analyst

@ Nedbank | Johannesburg, ZA

View on ai-jobs.net

Head of Data Science and Artificial Intelligence (m/f/d)

@ Project A Ventures | Munich, Germany

View on ai-jobs.net

Senior Data Scientist - GenAI

@ Roche | Hyderabad RSS

View on ai-jobs.net