Web: http://arxiv.org/abs/2203.01170

June 23, 2022, 1:11 a.m. | Asaf Cassel (1), Alon Cohen (2 and 3), Tomer Koren (1 and 3) ((1) School of Computer Science, Tel Aviv University, (2) School of Electrical Engineerin

cs.LG updates on arXiv.org arxiv.org

We consider the problem of controlling an unknown linear dynamical system
under a stochastic convex cost and full feedback of both the state and cost
function. We present a computationally efficient algorithm that attains an
optimal $\sqrt{T}$ regret-rate compared to the best stabilizing linear
controller in hindsight. In contrast to previous work, our algorithm is based
on the Optimism in the Face of Uncertainty paradigm. This results in a
substantially improved computational complexity and a simpler analysis.

arxiv dynamics linear math online stochastic

More from arxiv.org / cs.LG updates on arXiv.org

Machine Learning Researcher - Saalfeld Lab

@ Howard Hughes Medical Institute - Chevy Chase, MD | Ashburn, Virginia

Project Director, Machine Learning in US Health

@ ideas42.org | Remote, US

Data Science Intern

@ NannyML | Remote

Machine Learning Engineer NLP/Speech

@ Play.ht | Remote

Research Scientist, 3D Reconstruction

@ Yembo | Remote, US

Clinical Assistant or Associate Professor of Management Science and Systems

@ University at Buffalo | Buffalo, NY