ANACONDA: An Improved Dynamic Regret Algorithm for Adaptive Non-Stationary Dueling Bandits. (arXiv:2210.14322v1 [cs.LG]) | allainews.com

Oct. 27, 2022, 1:13 a.m. | Thomas Kleine Buening, Aadirupa Saha

stat.ML updates on arXiv.org arxiv.org

We study the problem of non-stationary dueling bandits and provide the first
adaptive dynamic regret algorithm for this problem. The only two existing
attempts in this line of work fall short across multiple dimensions, including
pessimistic measures of non-stationary complexity and non-adaptive parameter
tuning that requires knowledge of the number of preference changes. We develop
an elimination-based rescheduling algorithm to overcome these shortcomings and
show a near-optimal $\tilde{O}(\sqrt{S^{\texttt{CW}} T})$ dynamic regret bound,
where $S^{\texttt{CW}}$ is the number of times the …

algorithm anaconda arxiv

More from arxiv.org / stat.ML updates on arXiv.org

Calabi-Yau Four/Five/Six-folds as $\mathbb{P}^n_\textbf{w}$ Hypersurfaces: Machine Learning, Approximation, and Generation 16 hours ago | arxiv.org

abstract approximation arxiv five +17

Bayesian Quantile Regression with Subset Selection: A Posterior Summarization Perspective 16 hours ago | arxiv.org

abstract arxiv bayesian distribution +16

The Projected Covariance Measure for assumption-lean variable significance testing 16 hours ago | arxiv.org

abstract arxiv covariance lean +14

A Heteroskedasticity-Robust Overidentifying Restriction Test with High-Dimensional Covariates 16 hours ago | arxiv.org

abstract arxiv econ.em errors +11

Adjoint Sensitivity Analysis on Multi-Scale Bioprocess Stochastic Reaction Network 16 hours ago | arxiv.org

abstract analysis arxiv challenges +15

Neural Networks Optimized by Genetic Algorithms in Cosmology 16 hours ago | arxiv.org

abstract algorithms applications artificial +14

Seeded graph matching for the correlated Gaussian Wigner model via the projected power method 1 day, 16 hours ago | arxiv.org

abstract agreement arxiv edge +10

Convergence and Complexity Guarantee for Inexact First-order Riemannian Optimization Algorithms 1 day, 16 hours ago | arxiv.org

abstract algorithms analyze arxiv +11

Mixture of partially linear experts 1 day, 16 hours ago | arxiv.org

abstract arxiv benefits computational +9

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net