Oct. 7, 2022, 1:14 a.m. | Taira Tsuchiya, Shinji Ito, Junya Honda

stat.ML updates on arXiv.org arxiv.org

This study considers the partial monitoring problem with $k$-actions and
$d$-outcomes and provides the first best-of-both-worlds algorithms, whose
regrets are favorably bounded both in the stochastic and adversarial regimes.
In particular, we show that for non-degenerate locally observable games, the
regret is $O(m^2 k^4 \log(T) \log(k_{\Pi} T) / \Delta_{\min})$ in the
stochastic regime and $O(m k^{2/3} \sqrt{T \log(T) \log k_{\Pi}})$ in the
adversarial regime, where $T$ is the number of rounds, $m$ is the maximum
number of distinct observations per …

algorithms arxiv monitoring

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Machine Learning Engineer (m/f/d)

@ StepStone Group | Düsseldorf, Germany

2024 GDIA AI/ML Scientist - Supplemental

@ Ford Motor Company | United States