[Research] "Language models can explain neurons in language models" from OpenAI | allainews.com

May 10, 2023, 2:09 a.m. | /u/2600_yay

Natural Language Processing www.reddit.com

This new piece of research was released today (2023-05-09) by OpenAI: https://openai.com/research/language-models-can-explain-neurons-in-language-models

Using the `neuron-explainer` tool, you can drill down into individual neurons, e.g., here's one for "transition words at the beginning of sentences" like `however`, `additionally`, `since`, etc. https://openaipublic.blob.core.windows.net/neuron-explainer/neuron-viewer/index.html#/layers/15/neurons/4538

There's also an option to submit a better explanation for what the neuron is doing. (OpenAI is crowdsourcing the reinforcement learning human feedback - RLHF - portion of model building, it seems.) Perhaps you think you have a better label …

building click crowdsourcing feedback human human feedback languagetechnology neuron openai reinforcement reinforcement learning rlhf think transition words

More from www.reddit.com / Natural Language Processing

Which NLP-master programs in Europe are more cs-leaning? 17 hours ago | www.reddit.com

computational english europe germany +12

What do you think is the state of the art technique for matching a piece … 2 days, 15 hours ago | www.reddit.com

art city database example +9

Multilabel text classification on unlabled data 3 days, 5 hours ago | www.reddit.com

classification data finance isn +11

I made a text-game where all the LLMs trick each other pretending to be humans. … 3 days, 20 hours ago | www.reddit.com

game humans languagetechnology llms +3

Help with fraud recognition 4 days, 3 hours ago | www.reddit.com

bank code country detection +7

AI-proof language-related jobs in the United States? 5 days, 10 hours ago | www.reddit.com

jobs language languagetechnology management +4

Leveling up RAG 5 days, 19 hours ago | www.reddit.com

advanced advice cleaning context +8

Did we just receive an AI-generated meta-review? 1 week, 1 day ago | www.reddit.com

generated languagetechnology meta review

Found a Way to Keep Transcripts Going 24/7 1 week, 1 day ago | www.reddit.com

apple apple silicon bugs check +10

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net