When Not to Trust Language Models: Investigating Effectiveness of Parametric&Non-Parametric Memories | allainews.com

June 6, 2023, 5:31 p.m. | Allen Institute for AI

Allen Institute for AI www.youtube.com

Presentation of ACL 2023 main conference long paper "When Not to Trust Language Models: Investigating Effectiveness of Parametric and Non-Parametric Memories".

Alex Mallen*, Akari Asai*, Victor Zhong, Rajarshi Das, Daniel Khashabi, Hannaneh Hajishirzi

Despite their impressive performance on diverse tasks, large language models (LMs) still struggle with tasks requiring rich world knowledge, implying the limitations of relying solely on their parameters to encode a wealth of world knowledge. This paper aims to understand LMs' strengths and limitations in memorizing factual …

acl alex conference daniel diverse language language models large language models memories non-parametric paper parametric performance presentation trust

More from www.youtube.com / Allen Institute for AI

Making Health Knowledge Accessible Through Personalized Language Processing 58 minutes ago | www.youtube.com

abstract decisions general guide +17

Robot Learning by Understanding Egocentric Videos 1 week, 2 days ago | www.youtube.com

abstract and natural language processing computer computer vision +24

Project Sidewalk: Crowd+AI Techniques to Map and Assess Every Sidewalk in the World 1 week, 6 days ago | www.youtube.com

ai techniques every jon map +4

LMQL Programming Large Language Models 2 weeks, 5 days ago | www.youtube.com

berlin computer computer science eth +19

Does Generative AI Infringe Copyright? 3 weeks ago | www.youtube.com

copyright digital family generative +5

Figuring out how the world works: causality in a world full of real people 2 months ago | www.youtube.com

abstract ai systems alignment build +13

Machine-Checked Proofs, and the Rise of Formal Methods in Mathematics 2 months, 1 week ago | www.youtube.com

abstract artificial artificial intelligence assistant +16

Beyond Test Accuracies for Studying Deep Neural Networks 2 months, 2 weeks ago | www.youtube.com

abstract accuracy beyond community +12

Helping or Herding? Reward Model Ensembles Mitigate but do not Eliminate Reward Hacking 2 months, 3 weeks ago | www.youtube.com

abstract alignment discuss distribution +12

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

#13721 - Data Engineer - AI Model Testing

@ Qualitest | Miami, Florida, United States

View on ai-jobs.net

Elasticsearch Administrator

@ ManTech | 201BF - Customer Site, Chantilly, VA

View on ai-jobs.net