Learning Transformer Programs with Dan Friedman - #667 | allainews.com

Jan. 15, 2024, 7:28 p.m. | Sam Charrington

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) twimlai.com

Today, we continue our NeurIPS series with Dan Friedman, a PhD student in the Princeton NLP group. In our conversation, we explore his research on mechanistic interpretability for transformer models, specifically his paper, Learning Transformer Programs. The LTP paper proposes modifications to the transformer architecture which allow transformer models to be easily converted into human-readable programs, making them inherently interpretable. In our conversation, we compare the approach proposed by this research with prior approaches to understanding the models and their …

architecture conversation dan explore interpretability neurips nlp paper phd research series transformer transformer architecture transformer models

More from twimlai.com / The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Controlling Fusion Reactor Instability with Deep Reinforcement Learning with Aza Jalalvand - #682 4 days ago | twimlai.com

control discuss fusion nuclear +10

GraphRAG: Knowledge Graphs for AI Applications with Kirk Marple - #681 1 week, 4 days ago | twimlai.com

ai applications applications architecture ceo +13

GraphRAG: Knowledge Graphs for AI Applications with Kirk Marple - #681 1 week, 4 days ago | twimlai.com

ai applications applications architecture ceo +13

GraphRAG: Knowledge Graphs for AI Applications with Kirk Marple - #681 1 week, 4 days ago | twimlai.com

ai applications applications architecture ceo +13

Teaching Large Language Models to Reason with Reinforcement Learning with Alex Havrilla - #680 2 weeks, 2 days ago | twimlai.com

alex algorithms creativity discuss +15

Teaching Large Language Models to Reason with Reinforcement Learning with Alex Havrilla - #680 2 weeks, 2 days ago | twimlai.com

alex algorithms creativity discuss +15

Teaching Large Language Models to Reason with Reinforcement Learning with Alex Havrilla - #680 2 weeks, 2 days ago | twimlai.com

alex algorithms creativity discuss +15

Localizing and Editing Knowledge in LLMs with Peter Hase - #679 3 weeks, 3 days ago | twimlai.com

decisions discuss editing explore +15

Coercing LLMs to Do and Reveal (Almost) Anything with Jonas Geiping - #678 1 month ago | twimlai.com

agents explore highlighting institute +9

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Lead Data Scientist, Commercial Analytics

@ Checkout.com | London, United Kingdom

View on ai-jobs.net

Data Engineer I

@ Love's Travel Stops | Oklahoma City, OK, US, 73120

View on ai-jobs.net