s
April 24, 2024, 2:57 a.m. |

Simon Willison's Weblog simonwillison.net

openelm/README-pretraining.md


Apple released something big three hours ago, and I'm still trying to get my head around exactly what it is.


The parent project is called CoreNet, described as "A library for training deep neural networks". Part of the release is a new LLM called OpenELM, which includes completely open source training code and a large number of published training checkpoint.


I'm linking here to the best documentation I've found of that training data: it looks like the bulk of …

ai apple big code generativeai head library llm llms networks neural networks open source part pretraining project readme release something training

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Consultant Senior Power BI & Azure - CDI - H/F

@ Talan | Lyon, France