DeepSpeed: Advancing MoE inference and training to power next-generation AI scale | allainews.com

Jan. 19, 2022, 5:19 p.m. | Alyssa Hughes

Microsoft Research www.microsoft.com

In the last three years, the largest trained dense models have increased in size by over 1,000 times, from a few hundred million parameters to over 500 billion parameters in Megatron-Turing NLG 530B (MT-NLG). Improvements in model quality with size suggest that this trend will continue, with larger model sizes bringing better model quality. However, […]

The post DeepSpeed: Advancing MoE inference and training to power next-generation AI scale appeared first on Microsoft Research.

ai moe power research blog training

More from www.microsoft.com / Microsoft Research

SAMMO: A general-purpose framework for prompt optimization 6 days, 13 hours ago | www.microsoft.com

framework general guide llms +8

Research Focus: Week of April 15, 2024 1 week ago | www.microsoft.com

april cloud comet compression +15

Microsoft at NDSI 2024: Discoveries and implementations in networked systems 1 week, 1 day ago | www.microsoft.com

advances applications artificial artificial intelligence +17

Abstracts: April 16, 2024 1 week, 1 day ago | www.microsoft.com

april communication constellation devices +13

Ideas: Language technologies for everyone with Kalika Bali 1 week, 6 days ago | www.microsoft.com

career design her ideas +16

Research Focus: Week of April 1, 2024 3 weeks ago | www.microsoft.com

april benchmarking comet computer +15

AI Frontiers: Rethinking intelligence with Ashley Llorens and Ida Momennejad 3 weeks, 6 days ago | www.microsoft.com

brain building cognitive computer +14

Learning from interaction with Microsoft Copilot (web) 4 weeks ago | www.microsoft.com

ai system consumers copilot dynamic +12

Abstracts: March 21, 2024 1 month ago | www.microsoft.com

accuracy deep learning efficiency free +9

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Data Engineer

@ Parker | New York City

View on ai-jobs.net

Sr. Data Analyst | Home Solutions

@ Three Ships | Raleigh or Charlotte, NC

View on ai-jobs.net