all AI news
Instance-Dependent Confidence and Early Stopping for Reinforcement Learning. (arXiv:2201.08536v1 [stat.ML])
Web: http://arxiv.org/abs/2201.08536
Jan. 24, 2022, 2:10 a.m. | Koulik Khamaru, Eric Xia, Martin J. Wainwright, Michael I. Jordan
cs.LG updates on arXiv.org arxiv.org
Various algorithms for reinforcement learning (RL) exhibit dramatic variation
in their convergence rates as a function of problem structure. Such
problem-dependent behavior is not captured by worst-case analyses and has
accordingly inspired a growing effort in obtaining instance-dependent
guarantees and deriving instance-optimal algorithms for RL problems. This
research has been carried out, however, primarily within the confines of
theory, providing guarantees that explain \textit{ex post} the performance
differences observed. A natural next step is to convert these theoretical
guarantees into …
More from arxiv.org / cs.LG updates on arXiv.org
Latest AI/ML/Big Data Jobs
Data Scientist
@ Fluent, LLC | Boca Raton, Florida, United States
Big Data ETL Engineer
@ Binance.US | Vancouver
Data Scientist / Data Engineer
@ Kin + Carta | Chicago
Data Engineer
@ Craft | Warsaw, Masovian Voivodeship, Poland
Senior Manager, Data Analytics Audit
@ Affirm | Remote US
Data Scientist - Nationwide Opportunities, AWS Professional Services
@ Amazon.com | US, NC, Virtual Location - N Carolina