DeepSpeed Compression: A composable library for extreme compression and zero-cost quantization | allainews.com

July 20, 2022, 4 p.m. | Alyssa Hughes

Microsoft Research www.microsoft.com

Large-scale models are revolutionizing deep learning and AI research, driving major improvements in language understanding, generating creative texts, multi-lingual translation and many more. But despite their remarkable capabilities, the models’ large size creates latency and cost constraints that hinder the deployment of applications on top of them. In particular, increased inference time and memory consumption […]

The post DeepSpeed Compression: A composable library for extreme compression and zero-cost quantization appeared first on Microsoft Research.

compression cost library quantization research blog

More from www.microsoft.com / Microsoft Research

Ideas: Exploring AI frontiers with Rafah Hosn 16 hours ago | www.microsoft.com

advancement disruption drive frontiers +13

SAMMO: A general-purpose framework for prompt optimization 1 week ago | www.microsoft.com

framework general guide llms +8

Research Focus: Week of April 15, 2024 1 week, 1 day ago | www.microsoft.com

april cloud comet compression +15

Microsoft at NDSI 2024: Discoveries and implementations in networked systems 1 week, 2 days ago | www.microsoft.com

advances applications artificial artificial intelligence +17

Abstracts: April 16, 2024 1 week, 2 days ago | www.microsoft.com

april communication constellation devices +13

Ideas: Language technologies for everyone with Kalika Bali 2 weeks ago | www.microsoft.com

career design her ideas +16

Research Focus: Week of April 1, 2024 3 weeks, 1 day ago | www.microsoft.com

april benchmarking comet computer +15

AI Frontiers: Rethinking intelligence with Ashley Llorens and Ida Momennejad 4 weeks ago | www.microsoft.com

brain building cognitive computer +14

Learning from interaction with Microsoft Copilot (web) 4 weeks, 1 day ago | www.microsoft.com

ai system consumers copilot dynamic +12

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Technology Consultant Master Data Management (w/m/d)

@ SAP | Walldorf, DE, 69190

View on ai-jobs.net

Research Engineer, Computer Vision, Google Research

@ Google | Nairobi, Kenya

View on ai-jobs.net