FlashConv: Speeding up State Space Models

Jan. 23, 2023, 7:11 p.m. | Dan Fu and Tri Dao

Blog Content - TOGETHER www.together.xyz

Introducing FlashConv, a new technique for speeding up State space models
(SSMs) that enables training SSM-based language models up to 2.7B
parameters (with almost no attention) — and run inference 1.6X faster than
Transformers.

attention faster inference language language models research space state training transformers

Visit resource

More from www.together.xyz / Blog Content - TOGETHER

RedPajama-Data-v2: an Open Dataset with 30 Trillion Tokens for Training Large Language Models 6 months ago | www.together.xyz

annotations data data quality dataset +12

Flash-Decoding for long-context inference 6 months, 2 weeks ago | www.together.xyz

attention context decoding faster +3

Medusa: Simple Framework for Accelerating LLM Generation with Multiple Decoding Heads 7 months, 2 weeks ago | www.together.xyz

api context decoding framework +6

Llama-2-7B-32K-Instruct — and fine-tuning for Llama-2 models with Together API 8 months, 1 week ago | www.together.xyz

api context fine-tuning llama +2

Faster inference enables up to 5x price reduction on Together API 8 months, 2 weeks ago | www.together.xyz

ai stack api cost efficiency +12

Preparing for the era of 32K context: Early learnings and explorations 9 months ago | www.together.xyz

context document understanding llama research +2

Monarch Mixer: A new model architecture for increased efficiency 9 months ago | www.together.xyz

architecture efficiency exploration look +2

Introducing Together AI Chief Scientist Tri Dao, as he releases FlashAttention-2 to speed up model … 9 months, 2 weeks ago | www.together.xyz

ai models dao fine-tuning inference +5

Together AI and Snorkel AI empower enterprises to build proprietary LLMs 9 months, 2 weeks ago | www.together.xyz

build data enterprises environments +5

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Risk Management - Machine Learning and Model Delivery Services, Product Associate - Senior Associate-

@ JPMorgan Chase & Co. | Wilmington, DE, United States

View on ai-jobs.net

Senior ML Engineer (Speech/ASR)

@ ObserveAI | Bengaluru

View on ai-jobs.net

View more jobs

all AI news

FlashConv: Speeding up State Space Models

More from www.together.xyz / Blog Content - TOGETHER

Jobs in AI, ML, Big Data

Data Architect

Data ETL Engineer

Lead GNSS Data Scientist

Senior Machine Learning Engineer (MLOps)

Risk Management - Machine Learning and Model Delivery Services, Product Associate - Senior Associate-

Senior ML Engineer (Speech/ASR)