Llemma: An Open Language Model For Mathematics | allainews.com

Oct. 17, 2023, 2 a.m. |

Blog on EleutherAI Blog blog.eleuther.ai

ArXiv | Models | Data | Code | Blog | Sample Explorer
Today we release Llemma: 7 billion and 34 billion parameter language models for mathematics. The Llemma models were initialized with Code Llama weights, then trained on the Proof-Pile II, a 55 billion token dataset of mathematical and scientific documents. The resulting models show improved mathematical capabilities, and can be adapted to various tasks through prompting or additional fine-tuning.

arxiv billion blog code code llama data dataset documents language language model language models llama mathematics release token

More from blog.eleuther.ai / Blog on EleutherAI Blog

Pile-T5 3 weeks, 6 days ago | blog.eleuther.ai

Yi-34B, Llama 2, and common practices in LLM training: a fact check of the New … 1 month, 2 weeks ago | blog.eleuther.ai

check llama llama 2 llm +4

The Foundation Model Development Cheatsheet 2 months, 1 week ago | blog.eleuther.ai

cheatsheet dev development foundation +2

Least-Squares Concept Erasure with Oracle Concept Labels 4 months, 3 weeks ago | blog.eleuther.ai

concept inference labels least +2

Diff-in-Means Concept Editing is Worst-Case Optimal 5 months ago | blog.eleuther.ai

case concept diff editing +3

The third New England RLHF Hackers Hackathon 5 months, 2 weeks ago | blog.eleuther.ai

community discord elephants england +15

Extending the RoPE 5 months, 4 weeks ago | blog.eleuther.ai

eleutherai rope

How the Foundation Model Transparency Index Distorts Transparency 6 months, 2 weeks ago | blog.eleuther.ai

foundation foundation model foundation model transparency index index +2

Llemma: An Open Language Model For Mathematics 6 months, 3 weeks ago | blog.eleuther.ai

arxiv billion blog code +11

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net