s
Jan. 10, 2024, 5:09 a.m. |

Simon Willison's Weblog simonwillison.net

The Random Transformer


"Understand how transformers work by demystifying all the math behind them" - Omar Sanseviero from Hugging Face meticulously implements the transformer architecture behind LLMs from scratch using Python and numpy. There's a lot to take in here but it's all very clearly explained.


Via Hacker News

ai architecture explained face generativeai hacker hugging face llms math numpy python random them transformer transformer architecture transformers via work

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne