Anthropic researchers wear down AI ethics with repeated questions | allainews.com

April 2, 2024, 8:33 p.m. | Devin Coldewey

TechCrunch techcrunch.com

How do you get an AI to answer a question it’s not supposed to? There are many such “jailbreak” techniques, and Anthropic researchers just found a new one, in which a large language model can be convinced to tell you how to build a bomb if you prime it with a few dozen less-harmful questions […]

© 2024 TechCrunch. All rights reserved. For personal use only.

ai ai ethics anthropic build ethics found jailbreak language language model large language large language model prime question questions researchers

More from techcrunch.com / TechCrunch

Buymeacoffee’s founder has built an AI-powered voice note app 6 hours ago | techcrunch.com

ai ai-powered app apps +18

Google partners with Airtel to offer cloud and genAI products to Indian businesses 6 hours ago | techcrunch.com

ai airtel bharti airtel businesses +14

Women in AI: Rep. Dar’shun Kendrick wants to pass more AI legislation 21 hours ago | techcrunch.com

academics ai ai legislation artificial intelligence +14

Go on, let bots date other bots 1 day ago | techcrunch.com

ai apps black mirror bloomberg +17

U.K. agency releases tools to test AI model safety 1 day, 18 hours ago | techcrunch.com

academia agency ai ai model +13

At the AI Film Festival, humanity triumphed over tech 1 day, 20 hours ago | techcrunch.com

ai ai film festival 2024 ai tech animations +12

Motional cut about 550 employees, around 40%, in recent restructuring, sources say 2 days, 15 hours ago | techcrunch.com

autonomous autonomous vehicle commercial employees +14

OpenAI’s ChatGPT announcement: What we know so far 2 days, 16 hours ago | techcrunch.com

ai announcement chance chatgpt +11

Anthropic’s Claude sees tepid reception on iOS compared with ChatGPT’s debut 2 days, 18 hours ago | techcrunch.com

ai anthropic app apps +8

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net