How 'sleeper agent' AI assistants can sabotage your code without you realizing

Jan. 16, 2024, 9:30 p.m. | Thomas Claburn

The Register - Software: AI + ML www.theregister.com

Today's safety guardrails won't catch these backdoors, study warns

Analysis AI biz Anthropic has published research showing that large language models (LLMs) can be subverted in a way that safety training doesn't currently address.…

agent ai assistants analysis anthropic assistants code guardrails language language models large language large language models llms research safety study training

Visit resource

More from www.theregister.com / The Register - Software: AI + ML

Dear Stack Overflow denizens, thanks for helping train OpenAI's billion-dollar LLMs 19 hours ago | www.theregister.com

access ai models ai stack billion +20

Semiconductor digital twins to sip $285M from America's CHIPS Act funding pool 23 hours ago | www.theregister.com

act america chips chips act +10

Warren Buffett voices AI fears, likens tech to atom bomb 1 day, 1 hour ago | www.theregister.com

artificial artificial intelligence atom benefits +9

Has Windows 11 really lost marketshare to Windows 10? 1 day, 8 hours ago | www.theregister.com

gap imagine latest lost +7

Some scientists can't stop using AI to write research papers 4 days, 10 hours ago | www.theregister.com

articles chance generative literature +6

Atlassian outsources office drudgery to GenAI agents 4 days, 10 hours ago | www.theregister.com

agents applications atlassian become +14

UK inertia on LLMs and copyright is 'de facto endorsement' 4 days, 11 hours ago | www.theregister.com

build companies copyright creators +9

Microsoft continues multibillion-dollar cloud and AI sprinkle in Malaysia 5 days, 4 hours ago | www.theregister.com

ai infrastructure billion ceo cloud +7

Not a Genius move: Resurrecting war hero Alan Turing as your 'chief AI officer' 5 days, 11 hours ago | www.theregister.com

alan alan turing campaign chatbot +10

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

View more jobs

all AI news

How 'sleeper agent' AI assistants can sabotage your code without you realizing

Today's safety guardrails won't catch these backdoors, study warns

More from www.theregister.com / The Register - Software: AI + ML

Jobs in AI, ML, Big Data

Lead Developer (AI)

Research Engineer

Ecosystem Manager

Founding AI Engineer, Agents

AI Engineer Intern, Agents

AI Research Scientist