[R] Interpretability research in LLMs | allainews.com

June 27, 2024, 9:09 a.m. | /u/SkeeringReal

Machine Learning www.reddit.com

Most work in interpretable ML for LLMs has focused on mechanistic interpretability, rather than previous approaches in the literature like counterfactuals, case-based reasoning, prototypes, saliency maps, concept-based explanation, etc...

Why do you think that is? My feeling is it's because mech interp is just less computationally intensive to research, so it's the only option people really have with LLMs (where e.g., datasets are too big to do case-based reasoning). The other explanation is that people are just trying to move …

case concept etc interpretability interpretability research literature llms machinelearning maps people reasoning research think work you

More from www.reddit.com / Machine Learning

[D] What's the current battle-tested state-of-the-art multivariate time series regression mechanism? 3 hours ago | www.reddit.com

adoption art current industry +11

[R] GraphReader: A Graph-based AI Agent System Designed to Handle Long Texts by Structuring them … 7 hours ago | www.reddit.com

agent explore graph graph-based +2

[D] Coworkers recently told me that the people who think "LLMs are capable of thinking/understanding" … 12 hours ago | www.reddit.com

become career llms machinelearning +9

[D] Why do DINO models use augmentations for the teacher encoder? 18 hours ago | www.reddit.com

data encoder generate inputs +4

[D] Anyone see any real usage of Kolmogorov-Arnold Networks in the wild? 1 day, 8 hours ago | www.reddit.com

adoption good hype machinelearning +5

[D] "Grok" means way too many different things 1 day, 9 hours ago | www.reddit.com

big elon elon musk found +9

[R] Context-augmented Retrieval: A Novel Framework for Fast Information Retrieval based Response Generation using Large … 1 day, 10 hours ago | www.reddit.com

context framework information language +6

[P] Paddler (stateful load balancer custom-tailored for llama.cpp) 1 day, 12 hours ago | www.reddit.com

cloud cloud providers cpp custom +12

[R] Deep Learning Paper Summaries 1 day, 16 hours ago | www.reddit.com

conferences cvpr deep learning examples +9

VP, Enterprise Applications

@ Blue Yonder | Scottsdale

View on ai-jobs.net

Data Scientist - Moloco Commerce Media

@ Moloco | Redwood City, California, United States

View on ai-jobs.net

Senior Backend Engineer (New York)

@ Kalepa | New York City. Hybrid

View on ai-jobs.net

Senior Backend Engineer (USA)

@ Kalepa | New York City. Remote US.

View on ai-jobs.net

Senior Full Stack Engineer (USA)

@ Kalepa | New York City. Remote US.

View on ai-jobs.net

Senior Full Stack Engineer (New York)

@ Kalepa | New York City., Hybrid

View on ai-jobs.net