s
Feb. 2, 2024, 4:11 a.m. |

Simon Willison's Weblog simonwillison.net

Open Language Models (OLMos) and the LLM landscape


OLMo is a newly released LLM from the Allen Institute for AI (AI2) currently available in 7B and 1B parameters, trained on a fully openly published dataset called Dolma.


The model and code are Apache 2, while the data is under the "AI2 ImpACT license".


From the benchmark scores shared here by Nathan Lambert it looks like this may be the highest performing model currently available that was built using a fully …

ai ai2 allen allen institute allen institute for ai apache code data dataset dolma generativeai impact institute landscape language language models llm llms olmo openly parameters

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Reporting & Data Analytics Lead (Sizewell C)

@ EDF | London, GB

Data Analyst

@ Notable | San Mateo, CA