May 10, 2023, 2:09 a.m. | /u/2600_yay

Natural Language Processing www.reddit.com

This new piece of research was released today (2023-05-09) by OpenAI: https://openai.com/research/language-models-can-explain-neurons-in-language-models

Using the `neuron-explainer` tool, you can drill down into individual neurons, e.g., here's one for "transition words at the beginning of sentences" like `however`, `additionally`, `since`, etc. https://openaipublic.blob.core.windows.net/neuron-explainer/neuron-viewer/index.html#/layers/15/neurons/4538

There's also an option to submit a better explanation for what the neuron is doing. (OpenAI is crowdsourcing the reinforcement learning human feedback - RLHF - portion of model building, it seems.) Perhaps you think you have a better label …

building click crowdsourcing feedback human human feedback languagetechnology neuron openai reinforcement reinforcement learning rlhf think transition words

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne