OpenAI's new 'instruction hierarchy' could make AI models harder to fool

April 24, 2024, 11:29 a.m. | Matthias Bastian

OpenAI researchers propose an instruction hierarchy for AI language models. It is intended to reduce vulnerability to prompt injection attacks and jailbreaks. Initial results are promising.

The article OpenAI's new 'instruction hierarchy' could make AI models harder to fool appeared first on THE DECODER.

ai and safety ai language models ai models ai research article artificial intelligence attacks decoder language language models openai prompt prompt injection prompt injection attacks reduce researchers results the decoder vulnerability

Visit resource

More from the-decoder.com / THE DECODER

PE teacher allegedly used AI voice clone to bully principal out of office 11 hours ago | the-decoder.com

ai and audio ai and society ai-generated voice ai voice +17

Med-Gemini and Meditron: Google and Meta present new LLMs for medicine 12 hours ago | the-decoder.com

ai in medicine ai in practice article artificial intelligence +17

AGI could end humanity in more subtle ways than turning us into paperclips 14 hours ago | the-decoder.com

agi ai and society article artificial +14

Open-source model Prometheus 2 can evaluate other language models nearly as well as GPT-4 14 hours ago | the-decoder.com

ai and science ai in practice ai science article +12

OpenAI prepares its AI safety infrastructure for "advanced AI" 1 day, 10 hours ago | the-decoder.com

advanced advanced ai ai and safety ai in practice +10

Microsoft buys cloud growth in Southeast Asia for nearly $4 billion 1 day, 15 hours ago | the-decoder.com

ai infrastructure ai in practice article artificial intelligence +12

AI specialists frustrated with intense market pressure and "AI hype" 1 day, 15 hours ago | the-decoder.com

ai in practice ai tools amazon article +15

X's latest AI news is both ambitious and a recipe for chaos 1 day, 17 hours ago | the-decoder.com

ai and media ai in practice ai news ai-powered +16

OpenAI introduces personal data removal form as ChatGPT faces criticism over false information 2 days, 7 hours ago | the-decoder.com

ai and society article artificial intelligence chatgpt +11

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

View more jobs

all AI news

OpenAI's new 'instruction hierarchy' could make AI models harder to fool

More from the-decoder.com / THE DECODER

Jobs in AI, ML, Big Data

Founding AI Engineer, Agents

AI Engineer Intern, Agents

AI Research Scientist

Data Architect

Data ETL Engineer

Lead GNSS Data Scientist