Feb. 27, 2024, 5:43 a.m. | Masahiro Kato, Kaito Ariu

cs.LG updates on arXiv.org arxiv.org

arXiv:2106.14077v3 Announce Type: replace
Abstract: We study the best-arm identification problem with fixed confidence when contextual (covariate) information is available in stochastic bandits. Although we can use contextual information in each round, we are interested in the marginalized mean reward over the contextual distribution. Our goal is to identify the best arm with a minimal number of samplings under a given value of the error rate. We show the instance-specific sample complexity lower bounds for the problem. Then, we propose …

abstract arm arxiv confidence cs.lg distribution econ.em identification identify information math.st mean role stat.me stat.ml stat.th stochastic study type

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Risk Models Methodology & IRB, Student in Nordea

@ Nordea | Stockholm, SE, 111 46