Efficient Online Linear Control with Stochastic Convex Costs and Unknown Dynamics. (arXiv:2203.01170v2 [math.OC] UPDATED) | allainews.com

June 23, 2022, 1:12 a.m. | Asaf Cassel (1), Alon Cohen (2 and 3), Tomer Koren (1 and 3) ((1) School of Computer Science, Tel Aviv University, (2) School of Electrical Engineerin

stat.ML updates on arXiv.org arxiv.org

We consider the problem of controlling an unknown linear dynamical system
under a stochastic convex cost and full feedback of both the state and cost
function. We present a computationally efficient algorithm that attains an
optimal $\sqrt{T}$ regret-rate compared to the best stabilizing linear
controller in hindsight. In contrast to previous work, our algorithm is based
on the Optimism in the Face of Uncertainty paradigm. This results in a
substantially improved computational complexity and a simpler analysis.

arxiv costs dynamics linear math stochastic

More from arxiv.org / stat.ML updates on arXiv.org

Simultaneous upper and lower bounds of American option prices with hedging via neural networks 21 hours ago | arxiv.org

abstract arxiv form math.pr +11

Distributional Preference Learning: Understanding and Accounting for Hidden Context in RLHF 1 day, 21 hours ago | arxiv.org

accounting arxiv context cs.ai +6

Hacking Task Confounder in Meta-Learning 1 day, 21 hours ago | arxiv.org

abstract arxiv cs.lg hacking +12

Reflection coupling for unadjusted generalized Hamiltonian Monte Carlo in the nonconvex stochastic gradient case 1 day, 21 hours ago | arxiv.org

abstract algorithms arxiv case +10

Provable Reward-Agnostic Preference-Based Reinforcement Learning 1 day, 21 hours ago | arxiv.org

abstract agent arxiv cs.ai +16

Mastering Diverse Domains through World Models 1 day, 21 hours ago | arxiv.org

abstract algorithm algorithms application +22

Precise Asymptotics for Spectral Methods in Mixed Generalized Linear Models 1 day, 21 hours ago | arxiv.org

abstract arxiv cs.it cs.lg +14

Additive Covariance Matrix Models: Modelling Regional Electricity Net-Demand in Great Britain 1 day, 21 hours ago | arxiv.org

abstract arxiv britain consumption +18

Learning Algorithm Generalization Error Bounds via Auxiliary Distributions 1 day, 21 hours ago | arxiv.org

abstract algorithm arxiv cs.it +16

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Research Associate (Data Science/Information Engineering/Applied Mathematics/Information Technology)

@ Nanyang Technological University | NTU Main Campus, Singapore

View on ai-jobs.net

Associate Director of Data Science and Analytics

@ Penn State University | Penn State University Park

View on ai-jobs.net

Student Worker- Data Scientist

@ TransUnion | Israel - Tel Aviv

View on ai-jobs.net

Vice President - Customer Segment Analytics Data Science Lead

@ JPMorgan Chase & Co. | Bengaluru, Karnataka, India

View on ai-jobs.net

Middle/Senior Data Engineer

@ Devexperts | Sofia, Bulgaria

View on ai-jobs.net