March 29, 2024, 3:39 a.m. | /u/ghosthamlet

Machine Learning www.reddit.com

Post: [https://www.ai21.com/blog/announcing-jamba](https://www.ai21.com/blog/announcing-jamba)



>We are thrilled to announce Jamba, the world’s first production-grade Mamba based model. By enhancing [Mamba](https://arxiv.org/pdf/2312.00752.pdf) Structured State Space model (SSM) technology with elements of the traditional Transformer architecture, Jamba compensates for the inherent limitations of a pure SSM model. Offering a 256K context window, it is already demonstrating remarkable gains in throughput and efficiency—just the beginning of what can be possible with this innovative hybrid architecture. Notably, Jamba outperforms or matches other state-of-the-art models in its size …

machinelearning

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Intern Large Language Models Planning (f/m/x)

@ BMW Group | Munich, DE

Data Engineer Analytics

@ Meta | Menlo Park, CA | Remote, US