all AI news
Order-Optimal Regret in Distributed Kernel Bandits using Uniform Sampling with Shared Randomness
Feb. 21, 2024, 5:42 a.m. | Nikola Pavlovic, Sudeep Salgia, Qing Zhao
cs.LG updates on arXiv.org arxiv.org
Abstract: We consider distributed kernel bandits where $N$ agents aim to collaboratively maximize an unknown reward function that lies in a reproducing kernel Hilbert space. Each agent sequentially queries the function to obtain noisy observations at the query points. Agents can share information through a central server, with the objective of minimizing regret that is accumulating over time $T$ and aggregating over agents. We develop the first algorithm that achieves the optimal regret order (as defined …
abstract agent agents aim arxiv cs.dc cs.lg distributed function information kernel lies queries query randomness sampling space stat.ml through type uniform
More from arxiv.org / cs.LG updates on arXiv.org
Jobs in AI, ML, Big Data
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Senior Data Engineer
@ Quantexa | Sydney, New South Wales, Australia
Staff Analytics Engineer
@ Warner Bros. Discovery | NY New York 230 Park Avenue South