Feb. 16, 2024, 5:42 a.m. | Eduardo Ochoa Rivera, Ambuj Tewari

cs.LG updates on arXiv.org arxiv.org

arXiv:2402.09467v1 Announce Type: cross
Abstract: We study a novel pure exploration problem: the $\epsilon$-Thresholding Bandit Problem (TBP) with fixed confidence in stochastic linear bandits. We prove a lower bound for the sample complexity and extend an algorithm designed for Best Arm Identification in the linear case to TBP that is asymptotically optimal.

abstract algorithm arm arxiv case complexity confidence cs.lg exploration identification linear novel prove sample stat.ml stochastic study thresholding type

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US