all AI news
Best-of-Both-Worlds Algorithms for Partial Monitoring. (arXiv:2207.14550v2 [cs.LG] UPDATED)
Oct. 7, 2022, 1:14 a.m. | Taira Tsuchiya, Shinji Ito, Junya Honda
stat.ML updates on arXiv.org arxiv.org
This study considers the partial monitoring problem with $k$-actions and
$d$-outcomes and provides the first best-of-both-worlds algorithms, whose
regrets are favorably bounded both in the stochastic and adversarial regimes.
In particular, we show that for non-degenerate locally observable games, the
regret is $O(m^2 k^4 \log(T) \log(k_{\Pi} T) / \Delta_{\min})$ in the
stochastic regime and $O(m k^{2/3} \sqrt{T \log(T) \log k_{\Pi}})$ in the
adversarial regime, where $T$ is the number of rounds, $m$ is the maximum
number of distinct observations per …
More from arxiv.org / stat.ML updates on arXiv.org
Estimation Sample Complexity of a Class of Nonlinear Continuous-time Systems
2 days, 16 hours ago |
arxiv.org
Jobs in AI, ML, Big Data
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Machine Learning Engineer (m/f/d)
@ StepStone Group | Düsseldorf, Germany
2024 GDIA AI/ML Scientist - Supplemental
@ Ford Motor Company | United States