all AI news
The Elliptical Potential Lemma for General Distributions with an Application to Linear Thompson Sampling. (arXiv:2102.07987v3 [stat.ML] UPDATED)
Jan. 20, 2022, 2:11 a.m. | Nima Hamidi, Mohsen Bayati
cs.LG updates on arXiv.org arxiv.org
In this note, we introduce a general version of the well-known elliptical
potential lemma that is a widely used technique in the analysis of algorithms
in sequential learning and decision-making problems. We consider a stochastic
linear bandit setting where a decision-maker sequentially chooses among a set
of given actions, observes their noisy rewards, and aims to maximize her
cumulative expected reward over a decision-making horizon. The elliptical
potential lemma is a key tool for quantifying uncertainty in estimating
parameters of …
More from arxiv.org / cs.LG updates on arXiv.org
Jobs in AI, ML, Big Data
Senior Marketing Data Analyst
@ Amazon.com | Amsterdam, North Holland, NLD
Senior Data Analyst
@ MoneyLion | Kuala Lumpur, Kuala Lumpur, Malaysia
Data Management Specialist - Office of the CDO - Chase- Associate
@ JPMorgan Chase & Co. | LONDON, LONDON, United Kingdom
BI Data Analyst
@ Nedbank | Johannesburg, ZA
Head of Data Science and Artificial Intelligence (m/f/d)
@ Project A Ventures | Munich, Germany
Senior Data Scientist - GenAI
@ Roche | Hyderabad RSS