OpenAI's new 'instruction hierarchy' could make AI models harder to fool

April 24, 2024, 11:29 a.m. | Matthias Bastian

OpenAI researchers propose an instruction hierarchy for AI language models. It is intended to reduce vulnerability to prompt injection attacks and jailbreaks. Initial results are promising.

The article OpenAI's new 'instruction hierarchy' could make AI models harder to fool appeared first on THE DECODER.

ai and safety ai language models ai models ai research article artificial intelligence attacks decoder language language models openai prompt prompt injection prompt injection attacks reduce researchers results the decoder vulnerability

Visit resource

More from the-decoder.com / THE DECODER

Microsoft invests $3.3 billion in Wisconsin for, you guessed it, generative AI 34 minutes ago | the-decoder.com

ai in practice article artificial intelligence billion +6

DeepSeek-V2 is a Chinese flagship open source Mixture-of-Experts model 58 minutes ago | the-decoder.com

ai in practice article artificial intelligence balance +15

ElevenLabs unveils new AI music generator 'ElevenLabs Music' 2 hours ago | the-decoder.com

ai and art ai music ai research ai voice +15

Deepmind proves robust AI adaptation requires learning causal models under the hood 2 hours ago | the-decoder.com

adapt ai research article artificial +13

Stack Overflow community and OpenAI clash again over ChatGPT 4 hours ago | the-decoder.com

ai in practice ai models api article +14

Apple apologizes for iPad ad that reflects fears of generative AI 8 hours ago | the-decoder.com

ai and copyright ai and society apple article +10

Apple's new iPad commercial is the perfect visualization of everything that might be wrong with … 1 day ago | the-decoder.com

age ai and copyright ai and society apple +11

Singaporean authors don't want their government to use their work for AI training without permission 1 day ago | the-decoder.com

ai and copyright ai in practice ai language model ai training +15

US may regulate export of AI models to China and Russia 1 day, 1 hour ago | the-decoder.com

access advanced advanced ai ai in practice +15

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

View more jobs

all AI news

OpenAI's new 'instruction hierarchy' could make AI models harder to fool

More from the-decoder.com / THE DECODER

Jobs in AI, ML, Big Data

Artificial Intelligence – Bioinformatic Expert

Lead Developer (AI)

Research Engineer

Ecosystem Manager

Founding AI Engineer, Agents

AI Engineer Intern, Agents