So retrieval is what we needed? | allainews.com

Jan. 21, 2022, 5:57 a.m. | Matan Weksler

Towards AI - Medium pub.towardsai.net

Artificial Intelligence

Last month DeepMind published their new NLP model called RETRO (Retrieval-Enhanced TRansfOrmer) which according to the paper, is a leap forward in the NLP world in multiple aspects. A notable one is that while this model achieved comparable results to SOTA architecture (e.g., GPT-3) it’s X25 times smaller with only 7.5B parameters compared to the 178B parameters of AI21 Jurassic-1.

This breaks the presumption that bigger models mean better accuracy.

The main advantage of smaller models …

deep learning deepmind nlp retrieval transformers

More from pub.towardsai.net / Towards AI - Medium

Learn AI Together — Towards AI Community Newsletter #21 1 day, 7 hours ago | pub.towardsai.net

ai ai community artificial intelligence collaboration +13

Top Important LLM Papers for the Week from 15/04 to 21/04 2 days, 7 hours ago | pub.towardsai.net

ai data science deep learning language +8

Meta LLAMA 3 — Most Capable Open LLM 2 days, 9 hours ago | pub.towardsai.net

ai large language models llama llama 3 +5

Introduction of Neural Style Transfer — A Pioneer in Generative AI 3 days, 5 hours ago | pub.towardsai.net

algorithms art computer vision deep learning +10

Top Important Computer Vision Papers for the Week from 15/04 to 21/04 3 days, 7 hours ago | pub.towardsai.net

ai computer computer vision data science +5

This AI newsletter is all you need #96 3 days, 8 hours ago | pub.towardsai.net

ai ai newsletter announcement artificial intelligence +14

Prompt Engineering Best Practices: Building Chatbots 3 days, 9 hours ago | pub.towardsai.net

best practices building building chatbots chatbots +9

Unraveling the Web: Navigating Databases in Web Technology 3 days, 10 hours ago | pub.towardsai.net

apps big data context data +17

Llama 3 + Groq is the AI Heaven 3 days, 10 hours ago | pub.towardsai.net

ai groq llama llama 3 +5

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Senior Business Intelligence Developer / Analyst

@ Transamerica | Work From Home, USA

View on ai-jobs.net

Data Analyst (All Levels)

@ Noblis | Bethesda, MD, United States

View on ai-jobs.net