Dec. 18, 2023, 8:05 p.m. | 1littlecoder

1littlecoder www.youtube.com

If you have always dreamt of a world beyond transformers, Mamba is something to look deep into!

🔗 Links 🔗

Mamba-3B-SlimPJ: State-space models rivaling the best Transformer architecture
https://www.together.ai/blog/mamba-3b-slimpj

Mamba: Linear-Time Sequence Modeling with Selective State Spaces https://arxiv.org/pdf/2312.00752.pdf

Mamba Model Weights on Hugging Face Model Hub - https://huggingface.co/state-spaces/mamba-2.8b-slimpj


❤️ If you want to support the channel ❤️
Support here:
Patreon - https://www.patreon.com/1littlecoder/
Ko-Fi - https://ko-fi.com/1littlecoder

🧭 Follow me on 🧭
Twitter - https://twitter.com/1littlecoder
Linkedin - https://www.linkedin.com/in/amrrs/

architecture beyond look mamba something space state strikes support transformer transformer architecture transformers world

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US