all AI news
llm.c
April 9, 2024, 3:24 p.m. |
Simon Willison's Weblog simonwillison.net
Andrej Karpathy implements LLM training - initially for GPT-2, other architectures to follow - in just over 1,000 lines of C on top of CUDA. Includes a tutorial about implementing LayerNorm by porting an implementation from Python.
Via @karpathy
ai andrej karpathy andrejkarpathy architectures cuda generativeai gpt gpt-2 implementation llm llms python training tutorial via
More from simonwillison.net / Simon Willison's Weblog
How an empty S3 bucket can make your AWS bill explode
1 day, 20 hours ago |
simonwillison.net
My approach to HTML web components
1 day, 21 hours ago |
simonwillison.net
Jobs in AI, ML, Big Data
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
Risk Management - Machine Learning and Model Delivery Services, Product Associate - Senior Associate-
@ JPMorgan Chase & Co. | Wilmington, DE, United States
Senior ML Engineer (Speech/ASR)
@ ObserveAI | Bengaluru