Multiple-Play Stochastic Bandits with Shareable Finite-Capacity Arms. (arXiv:2206.08776v1 [cs.LG]) | allainews.com

June 20, 2022, 1:12 a.m. | Xuchuang Wang, Hong Xie, John C.S. Lui

stat.ML updates on arXiv.org arxiv.org

We generalize the multiple-play multi-armed bandits (MP-MAB) problem with a
shareable arm setting, in which several plays can share the same arm.
Furthermore, each shareable arm has a finite reward capacity and a ''per-load''
reward distribution, both of which are unknown to the learner. The reward from
a shareable arm is load-dependent, which is the "per-load" reward multiplying
either the number of plays pulling the arm, or its reward capacity when the
number of plays exceeds the capacity limit. When …

arxiv capacity lg stochastic

More from arxiv.org / stat.ML updates on arXiv.org

Estimation Sample Complexity of a Class of Nonlinear Continuous-time Systems 23 hours ago | arxiv.org

abstract arxiv class complexity +14

Estimation and Uniform Inference in Sparse High-Dimensional Additive Models 23 hours ago | arxiv.org

abstract arxiv confidence construct +9

GIST: Gibbs self-tuning for locally adaptive Hamiltonian Monte Carlo 23 hours ago | arxiv.org

abstract algorithm arxiv framework +13

Variational Bayesian surrogate modelling with application to robust design optimisation 23 hours ago | arxiv.org

abstract application approximation arxiv +20

Corrected generalized cross-validation for finite ensembles of penalized estimators 1 day, 23 hours ago | arxiv.org

abstract arxiv error freedom +13

Statistical Inference for Heterogeneous Treatment Effects Discovered by Generic Machine Learning in Randomized Experiments 1 day, 23 hours ago | arxiv.org

abstract algorithms arxiv causal +15

Asymptotic Validity and Finite-Sample Properties of Approximate Randomization Tests 1 day, 23 hours ago | arxiv.org

abstract arxiv data distribution +11

Preserving linear invariants in ensemble filtering methods 1 day, 23 hours ago | arxiv.org

abstract arxiv ensemble errors +13

Prediction of flow and elastic stresses in a viscoelastic turbulent channel flow using convolutional neural … 1 day, 23 hours ago | arxiv.org

abstract arxiv convolutional neural networks data +12

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Data Engineer

@ Parker | New York City

View on ai-jobs.net

Sr. Data Analyst | Home Solutions

@ Three Ships | Raleigh or Charlotte, NC

View on ai-jobs.net