Dec. 9, 2023, 2:47 p.m. | /u/mrobo_5ht2a

Machine Learning www.reddit.com

Link to project: https://github.com/microlib-org/llm_microlibs

Documentation is currently lacking, coming soon

Warning: very long post. TLDR: this post answers some questions I had about generating text with full, unquantized Falcon-180B under budget constraints.

# What is the goal

The goal is to benchmark full, unquantized Falcon-180B. I chose Falcon-180B because it is the biggest open-source model available currently. I also do not use any optimization such as speculative decoding or any kind of quantization. I benchmark both for small and large …

benchmark budget constraints documentation falcon long post machinelearning optimization questions text

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US