Sparse Transformers | A Demo

April 18, 2022, 4:03 p.m. | Ricky Costa

How fast can BERT go with sparsity?

Here’s a Little Secret:

If you want to analyze how fast 19 sparse BERT models perform inference, you’ll only need a YAML file and 16GB of RAM to find out. And spoiler alert:

… they run on CPUs.

… and they’re super fast!

The latest feature from Neural Magic’s DeepSparse repo is the DeepSparse Server! And the objective of this article is to show not only how …

artificial intelligence data science deep learning demo machine learning technology transformers

Visit resource

More from pub.towardsai.net / Towards AI - Medium

Top Important LLM Papers for the Week from 15/04 to 21/04 1 day ago | pub.towardsai.net

ai data science deep learning language +8

Meta LLAMA 3 — Most Capable Open LLM 1 day, 2 hours ago | pub.towardsai.net

ai large language models llama llama 3 +5

Introduction of Neural Style Transfer — A Pioneer in Generative AI 1 day, 22 hours ago | pub.towardsai.net

algorithms art computer vision deep learning +10

Top Important Computer Vision Papers for the Week from 15/04 to 21/04 2 days ago | pub.towardsai.net

ai computer computer vision data science +5

This AI newsletter is all you need #96 2 days, 1 hour ago | pub.towardsai.net

ai ai newsletter announcement artificial intelligence +14

Prompt Engineering Best Practices: Building Chatbots 2 days, 2 hours ago | pub.towardsai.net

best practices building building chatbots chatbots +9

Unraveling the Web: Navigating Databases in Web Technology 2 days, 2 hours ago | pub.towardsai.net

apps big data context data +17

Llama 3 + Groq is the AI Heaven 2 days, 2 hours ago | pub.towardsai.net

ai groq llama llama 3 +5

Why does ChatGPT use “Delve” so much? Mystery Solved. 2 days, 22 hours ago | pub.towardsai.net

chatgpt generative ai tools llm machine learning +5

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Staff Software Engineer, Generative AI, Google Cloud AI

@ Google | Mountain View, CA, USA; Sunnyvale, CA, USA

View on ai-jobs.net

Expert Data Sciences

@ Gainwell Technologies | Any city, CO, US, 99999

View on ai-jobs.net

View more jobs

all AI news

Sparse Transformers | A Demo

How fast can BERT go with sparsity?

Here’s a Little Secret:

More from pub.towardsai.net / Towards AI - Medium

Jobs in AI, ML, Big Data

Data Architect

Data ETL Engineer

Lead GNSS Data Scientist

Senior Machine Learning Engineer (MLOps)

Staff Software Engineer, Generative AI, Google Cloud AI

Expert Data Sciences