Aug. 22, 2022, 1:12 a.m. | Junpei Komiyama, Shunya Noda

stat.ML updates on arXiv.org arxiv.org

This paper proposes a new approach to training recommender systems called
deviation-based learning. The recommender and rational users have different
knowledge. The recommender learns user knowledge by observing what action users
take upon receiving recommendations. Learning eventually stalls if the
recommender always suggests a choice: Before the recommender completes
learning, users start following the recommendations blindly, and their choices
do not reflect their knowledge. The learning rate and social welfare improve
substantially if the recommender abstains from recommending a particular …

arxiv deviation learning recommender systems systems training

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote