s
March 29, 2024, 2:40 p.m. |

Simon Willison's Weblog simonwillison.net

SEP Dataset: Should it be Separated or Processed?


Released in conjunction with a new paper about prompt injection: Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?


Spoiler: we can't.


This dataset includes 9,160 test cases. They look like this (slightly simplified):


System prompt: Update the following text with modern language and contexts while retaining the original tone.


Prompt: It is a truth universally acknowledged, that a single man in possession of a good fortune, …

ai cases data dataset generativeai llms look mean paper prompt prompt injection promptinjection security simplified test update

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

Data Analyst

@ S&P Global | IN - HYDERABAD SKYVIEW

EY GDS Internship Program - Junior Data Visualization Engineer (June - July 2024)

@ EY | Wrocław, DS, PL, 50-086

Staff Data Scientist

@ ServiceTitan | INT Armenia Yerevan

Master thesis on deterministic AI inference on-board Telecom Satellites

@ Airbus | Taufkirchen / Ottobrunn

Lead Data Scientist

@ Picket | Seattle, WA