Researchers detail ArtPrompt, a jailbreak that uses ASCII art to elicit harmful responses from aligned LLMs such as GPT-3.5, GPT-4, Gemini, Claude, and Llama2 (Dan Goodin/Ars Technica)

March 16, 2024, 4:50 a.m. |

Dan Goodin / Ars Technica:

Researchers detail ArtPrompt, a jailbreak that uses ASCII art to elicit harmful responses from aligned LLMs such as GPT-3.5, GPT-4, Gemini, Claude, and Llama2 — LLMs are trained to block harmful responses. Old-school images can override those rules. — Researchers have discovered …

ars technica art ascii claude dan gemini gpt gpt-3 gpt-3.5 gpt-4 jailbreak llama2 llms researchers responses

Visit resource

More from www.techmeme.com / Techmeme

An interview with Kokusai Electric CEO Fumiyuki Kanai on KKR's role in the Japanese chip … an hour ago | www.techmeme.com

ceo chip david electric +10

A look at Overland AI, Potential, and other startups developing off-road self-driving systems, including for … 4 hours ago | www.techmeme.com

driving early-stage startups look military +10

Microsoft, Meta, Google and others pitch smaller language models that are cheaper to build and … 8 hours ago | www.techmeme.com

build costs financial financial times +10

A look at Meta's AI strategy, which is a bet that open sourcing the tech … 12 hours ago | www.techmeme.com

ai strategy competitors drive journal +8

Sources: Apple and OpenAI are preparing a major announcement of their partnership at WWDC; the … 15 hours ago | www.techmeme.com

announcement apple bloomberg chip +6

A look at UK-based Faculty, which has been awarded UK government contracts for AI safety … 21 hours ago | www.techmeme.com

competition contracts faculty government +8

Culture secretary Lucy Frazer says the UK is working on rules around the use of … 1 day ago | www.techmeme.com

arts concerns creative culture +11

How China is using AI news anchors to spread its propaganda on social media; Microsoft: … 1 day ago | www.techmeme.com

ai news anchors app avatars +11

Chinese firms are selling "AI-in-a-box" products for companies to run on premises; Huawei estimates the … 1 day, 1 hour ago | www.techmeme.com

box chinese companies financial +9

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

all AI news

Researchers detail ArtPrompt, a jailbreak that uses ASCII art to elicit harmful responses from aligned LLMs such as GPT-3.5, GPT-4, Gemini, Claude, and Llama2 (Dan Goodin/Ars Technica)

More from www.techmeme.com / Techmeme

Jobs in AI, ML, Big Data

Software Engineer for AI Training Data (School Specific)

Software Engineer for AI Training Data (Python)

Software Engineer for AI Training Data (Tier 2)

Data Engineer

Artificial Intelligence – Bioinformatic Expert

Lead Developer (AI)