all AI news
[P] char-mamba: Simple Mamba-based Character-level Language Modeling
March 30, 2024, 12:50 p.m. | /u/necrashter
Machine Learning www.reddit.com
[GitHub Repository](https://github.com/necrashter/char-mamba)
Any plain text file can be used as a dataset. By default, it will automatically download and use the [Tiny Shakespeare dataset](https://raw.githubusercontent.com/karpathy/char-rnn/master/data/tinyshakespeare/input.txt).
Since the code is quite simple, it can be also used as a **template for training Mamba models from scratch**, applicable to a wide array of sequence-to-sequence problems.
I hope …
array code machinelearning mamba scratch simple template training will
More from www.reddit.com / Machine Learning
Jobs in AI, ML, Big Data
Lead Developer (AI)
@ Cere Network | San Francisco, US
Research Engineer
@ Allora Labs | Remote
Ecosystem Manager
@ Allora Labs | Remote
Founding AI Engineer, Agents
@ Occam AI | New York
AI Engineer Intern, Agents
@ Occam AI | US
AI Research Scientist
@ Vara | Berlin, Germany and Remote