Dec. 18, 2023, 8:05 p.m. | 1littlecoder

1littlecoder www.youtube.com

If you have always dreamt of a world beyond transformers, Mamba is something to look deep into!

🔗 Links 🔗

Mamba-3B-SlimPJ: State-space models rivaling the best Transformer architecture
https://www.together.ai/blog/mamba-3b-slimpj

Mamba: Linear-Time Sequence Modeling with Selective State Spaces https://arxiv.org/pdf/2312.00752.pdf

Mamba Model Weights on Hugging Face Model Hub - https://huggingface.co/state-spaces/mamba-2.8b-slimpj


❤️ If you want to support the channel ❤️
Support here:
Patreon - https://www.patreon.com/1littlecoder/
Ko-Fi - https://ko-fi.com/1littlecoder

🧭 Follow me on 🧭
Twitter - https://twitter.com/1littlecoder
Linkedin - https://www.linkedin.com/in/amrrs/

architecture beyond look mamba something space state strikes support transformer transformer architecture transformers world

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)

@ HelloBetter | Remote

Doctoral Researcher (m/f/div) in Automated Processing of Bioimages

@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena

Seeking Developers and Engineers for AI T-Shirt Generator Project

@ Chevon Hicks | Remote

Data Analyst (Salesforce)

@ Lisinski Law Firm | Latin America

Data Analyst

@ Fusemachines | India - Remote