Collaborating with Humans without Human Data. (arXiv:2110.08176v2 [cs.LG] UPDATED) | allainews.com

Jan. 10, 2022, 2:10 a.m. | DJ Strouse, Kevin R. McKee, Matt Botvinick, Edward Hughes, Richard Everett

cs.LG updates on arXiv.org arxiv.org

Collaborating with humans requires rapidly adapting to their individual
strengths, weaknesses, and preferences. Unfortunately, most standard
multi-agent reinforcement learning techniques, such as self-play (SP) or
population play (PP), produce agents that overfit to their training partners
and do not generalize well to humans. Alternatively, researchers can collect
human data, train a human model using behavioral cloning, and then use that
model to train "human-aware" agents ("behavioral cloning play", or BCP). While
such an approach can improve the generalization of agents …

arxiv data human humans

More from arxiv.org / cs.LG updates on arXiv.org

Learning to Manipulate under Limited Information 1 day, 15 hours ago | arxiv.org

abstract arxiv become cs.ai +13

What Makes Good Data for Alignment? A Comprehensive Study of Automatic Data Selection in Instruction … 1 day, 15 hours ago | arxiv.org

abstract alignment arxiv cs.ai +17

Evolutionary Optimization of 1D-CNN for Non-contact Respiration Pattern Classification 1 day, 15 hours ago | arxiv.org

abstract arxiv classification cnn +17

Regularization by Texts for Latent Diffusion Inverse Solvers 1 day, 15 hours ago | arxiv.org

abstract arxiv challenges cs.ai +10

A Systematic Review of Aspect-based Sentiment Analysis (ABSA): Domains, Methods, and Trends 1 day, 15 hours ago | arxiv.org

abstract analysis arxiv cs.cl +13

Fossil 2.0: Formal Certificate Synthesis for the Verification and Control of Dynamical Models 1 day, 15 hours ago | arxiv.org

abstract arxiv control cs.lg +16

In-Context Learning Dynamics with Random Binary Sequences 1 day, 15 hours ago | arxiv.org

abstract art arxiv binary +24

Sharp error bounds for imbalanced classification: how many examples in the minority class? 1 day, 15 hours ago | arxiv.org

abstract arxiv class classification +15

When can transformers reason with abstract symbols? 1 day, 15 hours ago | arxiv.org

abstract arxiv capabilities cs.ai +19

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

View on ai-jobs.net

Data Analyst, Tableau

@ NTT DATA | Bengaluru, KA, IN

View on ai-jobs.net

Junior Machine Learning Researcher

@ Weill Cornell Medicine | Doha, QA, 24144

View on ai-jobs.net

Marketing Data Analytics Intern

@ Sloan | Franklin Park, IL, US, 60131

View on ai-jobs.net

Senior Machine Learning Scientist

@ Adyen | Amsterdam

View on ai-jobs.net

Data Engineer

@ Craft.co | Warsaw, Mazowieckie

View on ai-jobs.net