all AI news
[P] llm_microlibs: Building blocks for running models in distributed mode under budget constraints
Dec. 9, 2023, 2:47 p.m. | /u/mrobo_5ht2a
Machine Learning www.reddit.com
Documentation is currently lacking, coming soon
Warning: very long post. TLDR: this post answers some questions I had about generating text with full, unquantized Falcon-180B under budget constraints.
# What is the goal
The goal is to benchmark full, unquantized Falcon-180B. I chose Falcon-180B because it is the biggest open-source model available currently. I also do not use any optimization such as speculative decoding or any kind of quantization. I benchmark both for small and large …
benchmark budget constraints documentation falcon long post machinelearning optimization questions text
More from www.reddit.com / Machine Learning
[R] AlphaMath Almost Zero: process Supervision without process
1 day, 4 hours ago |
www.reddit.com
[D] ECCV 2024 Review Discussion
1 day, 5 hours ago |
www.reddit.com
[D] Is it a good idea for a 3rd year PhD student to start a …
1 day, 7 hours ago |
www.reddit.com
[D] Use VQ-VAEs for SSL?
1 day, 7 hours ago |
www.reddit.com
Jobs in AI, ML, Big Data
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US
Research Engineer
@ Allora Labs | Remote
Ecosystem Manager
@ Allora Labs | Remote
Founding AI Engineer, Agents
@ Occam AI | New York
AI Engineer Intern, Agents
@ Occam AI | US