all AI news
Learning the Pareto Front Using Bootstrapped Observation Samples
May 24, 2024, 4:47 a.m. | Wonyoung Kim, Garud Iyengar, Assaf Zeevi
cs.LG updates on arXiv.org arxiv.org
Abstract: We consider Pareto front identification (PFI) for linear bandits (PFILin), i.e., the goal is to identify a set of arms with undominated mean reward vectors when the mean reward vector is a linear function of the context. PFILin includes the best arm identification problem and multi-objective active learning as special cases. The sample complexity of our proposed algorithm is optimal up to a logarithmic factor. In addition, the regret incurred by our algorithm during the …
abstract arm arxiv context cs.lg front function identification identify linear mean multi-objective observation pareto pfi replace samples set stat.ml type vector vectors
More from arxiv.org / cs.LG updates on arXiv.org
Jobs in AI, ML, Big Data
Senior Data Engineer
@ Displate | Warsaw
Senior Robotics Engineer - Applications
@ Vention | Montréal, QC, Canada
Senior Application Security Engineer, SHINE - Security Hub for Innovation and Efficiency
@ Amazon.com | Toronto, Ontario, CAN
Simulation Scientist , WWDE Simulation
@ Amazon.com | Bellevue, Washington, USA
Giáo Viên Steam
@ Việc Làm Giáo Dục | Da Nang, Da Nang, Vietnam
Senior Simulation Developer
@ Vention | Montréal, QC, Canada