Feb. 8, 2024, 10:22 a.m. | /u/Henrie_the_dreamer

Deep Learning www.reddit.com

Hey guys, I just published the developer version of NanoDL, a library for developing transformer models within the Jax/Flax ecosystem and would love your feedback!

Key Features of NanoDL include:

* A wide array of blocks and layers, facilitating the creation of customised transformer models from scratch.
* An extensive selection of models like LlaMa2, Mistral, Mixtral, GPT3, GPT4 (inferred), T5, Whisper, ViT, Mixers, GAT, CLIP, and more, catering to a variety of tasks and applications.
* Data-parallel distributed trainers …

