all AI news
FlashConv: Speeding up State Space Models
Jan. 23, 2023, 7:11 p.m. | Dan Fu and Tri Dao
Blog Content - TOGETHER www.together.xyz
(SSMs) that enables training SSM-based language models up to 2.7B
parameters (with almost no attention) — and run inference 1.6X faster than
Transformers.
attention faster inference language language models research space state training transformers
More from www.together.xyz / Blog Content - TOGETHER
Flash-Decoding for long-context inference
6 months, 2 weeks ago |
www.together.xyz
Faster inference enables up to 5x price reduction on Together API
8 months, 2 weeks ago |
www.together.xyz
Jobs in AI, ML, Big Data
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Risk Management - Machine Learning and Model Delivery Services, Product Associate - Senior Associate-
@ JPMorgan Chase & Co. | Wilmington, DE, United States
Senior ML Engineer (Speech/ASR)
@ ObserveAI | Bengaluru