137 - Nearest Neighbor Language Modeling and Machine Translation, with Urvashi Khandelwal | allainews.com

Jan. 13, 2023, 10:59 p.m. | Allen Institute for Artificial Intelligence

NLP Highlights allenai.org

We invited Urvashi Khandelwal, a research scientist at Google Brain to talk about nearest neighbor language and machine translation models. These models interpolate parametric (conditional) language models with non-parametric distributions over the closest values in some data stores built from relevant data. Not only are these models shown to outperform the usual parametric language models, they also have important implications on memorization and generalization in language models.

Urvashi's webpage: https://urvashik.github.io
Papers discussed:
1) Generalization through memorization: Nearest Neighbor Language Models …

brain data data stores google google brain language language models machine machine translation modeling non-parametric parametric research talk translation values

More from allenai.org / NLP Highlights

Are LLMs safe? 1 month, 4 weeks ago | allenai.org

allen allen institute artificial artificial intelligence +13

"Imaginative AI" with Mohamed Elhoseiny 3 months, 2 weeks ago | allenai.org

art assistant computational computer +12

Science Of Science, with Kyle Lo 3 months, 4 weeks ago | allenai.org

141 - Building an open source LM, with Iz Beltagy and Dirk Groeneveld 9 months, 4 weeks ago | allenai.org

building highlights language language models +8

140 - Generative AI and Copyright, with Chris Callison-Burch 10 months, 3 weeks ago | allenai.org

ai and copyright chris congress copyright +5

139 - Coherent Long Story Generation, with Kevin Yang 1 year, 1 month ago | allenai.org

challenges editing generated language +7

138 - Compositional Generalization in Neural Networks, with Najoung Kim 1 year, 3 months ago | allenai.org

data dataset distribution information +11

137 - Nearest Neighbor Language Modeling and Machine Translation, with Urvashi Khandelwal 1 year, 3 months ago | allenai.org

brain data data stores google +12

136 - Including Signed Languages in NLP, with Kayo Yin and Malihe Alikhani 1 year, 11 months ago | allenai.org

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Senior ML Engineer

@ Carousell Group | Ho Chi Minh City, Vietnam

View on ai-jobs.net

Data and Insight Analyst

@ Cotiviti | Remote, United States

View on ai-jobs.net