all AI news
llm.c
April 9, 2024, 3:24 p.m. |
Simon Willison's Weblog simonwillison.net
Andrej Karpathy implements LLM training - initially for GPT-2, other architectures to follow - in just over 1,000 lines of C on top of CUDA. Includes a tutorial about implementing LayerNorm by porting an implementation from Python.
Via @karpathy
ai andrej karpathy andrejkarpathy architectures cuda generativeai gpt gpt-2 implementation llm llms python training tutorial via
More from simonwillison.net / Simon Willison's Weblog
AI counter app from my PyCon US keynote
1 day, 21 hours ago |
simonwillison.net
Understand errors and warnings better with Gemini
2 days, 14 hours ago |
simonwillison.net
Commit: Add a shared credentials relationship from twitter.com to x.com
2 days, 16 hours ago |
simonwillison.net
Jobs in AI, ML, Big Data
Software Engineer for AI Training Data (School Specific)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Python)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Tier 2)
@ G2i Inc | Remote
Data Engineer
@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US