Feb. 8, 2024, 10:22 a.m. | /u/Henrie_the_dreamer

Deep Learning www.reddit.com

Hey guys, I just published the developer version of NanoDL, a library for developing transformer models within the Jax/Flax ecosystem and would love your feedback!

Key Features of NanoDL include:

* A wide array of blocks and layers, facilitating the creation of customised transformer models from scratch.
* An extensive selection of models like LlaMa2, Mistral, Mixtral, GPT3, GPT4 (inferred), T5, Whisper, ViT, Mixers, GAT, CLIP, and more, catering to a variety of tasks and applications.
* Data-parallel distributed trainers …

array building deeplearning developer ecosystem features feedback hey jax key library love transformer transformer models

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote