Anthropic researchers wear down AI ethics with repeated questions | allainews.com

April 2, 2024, 8:33 p.m. | Devin Coldewey

TechCrunch techcrunch.com

How do you get an AI to answer a question it’s not supposed to? There are many such “jailbreak” techniques, and Anthropic researchers just found a new one, in which a large language model can be convinced to tell you how to build a bomb if you prime it with a few dozen less-harmful questions […]

© 2024 TechCrunch. All rights reserved. For personal use only.

ai ai ethics anthropic build ethics found jailbreak language language model large language large language model prime question questions researchers

More from techcrunch.com / TechCrunch

SafeBase taps AI to automate software security reviews an hour ago | techcrunch.com

ai arm automate automation +18

The first-ever race between four self-driving cars and a Formula 1 driver just happened in … 2 hours ago | techcrunch.com

abu dhabi autonomous vehicle cars computer +18

Yelp is launching a new AI assistant to help you connect with businesses 3 hours ago | techcrunch.com

ai ai assistant ai chatbots ai-powered +18

Devastated by his image being posted to a porn site, this founder hit on an … 5 hours ago | techcrunch.com

ai dan felt founder +8

neuroClues wants to put high speed eye tracking tech in the doctor’s office 7 hours ago | techcrunch.com

ai analysis biotech & health brain +21

As VC firms invest more in B2B startups, Intuition is a new VC fund focusing … 8 hours ago | techcrunch.com

ai company b2b consumer consumer tech +17

Solo GP fund Andrena Ventures hopes to carry startup talent onto its next challenges 8 hours ago | techcrunch.com

andrena ventures challenges entreé capital entrepreneurs +13

Google Gemini: Everything you need to know about the new generative AI platform 15 hours ago | techcrunch.com

ai ai models ai platform apps +15

NIST launches a new platform to assess generative AI 17 hours ago | techcrunch.com

agency ai emerging tech generative +10

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Business Intelligence Manager

@ Sanofi | Budapest

View on ai-jobs.net

Principal Engineer, Data (Hybrid)

@ Homebase | Toronto, Ontario, Canada

View on ai-jobs.net