GGUF, the long way around

Feb. 29, 2024, 9:39 p.m. |

Simon Willison's Weblog simonwillison.net

Vicki Boykis dives deep into the GGUF format used by llama.cpp, after starting with a detailed description of how PyTorch models work and how they are traditionally persisted using Python pickle.

Pickle lead to safetensors, a format that avoided the security problems with downloading and running untrusted pickle files.

Llama.cpp introduced GGML, which popularized 16-bit (as opposed to 32-bit) quantization and bundled metadata and tensor data in a single file.

GGUF fixed some design flaws …

ai cpp files format generativeai llama llms python pytorch running security work

Visit resource

More from simonwillison.net / Simon Willison's Weblog

I'm writing a new vector search SQLite Extension 9 hours ago | simonwillison.net

alex alexgarcia dependencies embeddings +14

Quoting Zach Seward 16 hours ago | simonwillison.net

advances ai attention bias +14

Printing music with CSS Grid 21 hours ago | simonwillison.net

application bond column css +10

We can have a different web 1 day, 9 hours ago | simonwillison.net

audio dog headphones mollywhite +2

Quoting Tom Eastman 1 day, 9 hours ago | simonwillison.net

five internet remember when text +2

Llama 3 prompt formats 1 day, 17 hours ago | simonwillison.net

ai clear documentation every +12

Introducing the Claude Team plan and iOS app 1 day, 20 hours ago | simonwillison.net

access anthropic app claude +11

Save the Web by Being Nice 2 days, 9 hours ago | simonwillison.net

andrew article blog blogging +6

Quoting LMSYS 2 days, 15 hours ago | simonwillison.net

ai api commercial community +9

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Robotics Technician - 3rd Shift

@ GXO Logistics | Perris, CA, US, 92571

View on ai-jobs.net

View more jobs

all AI news

GGUF, the long way around

More from simonwillison.net / Simon Willison's Weblog

Jobs in AI, ML, Big Data

AI Research Scientist

Data Architect

Data ETL Engineer

Lead GNSS Data Scientist

Senior Machine Learning Engineer (MLOps)

Robotics Technician - 3rd Shift