s
March 29, 2024, 2:40 p.m. |

Simon Willison's Weblog simonwillison.net

SEP Dataset: Should it be Separated or Processed?


Released in conjunction with a new paper about prompt injection: Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?


Spoiler: we can't.


This dataset includes 9,160 test cases. They look like this (slightly simplified):


System prompt: Update the following text with modern language and contexts while retaining the original tone.


Prompt: It is a truth universally acknowledged, that a single man in possession of a good fortune, …

ai cases data dataset generativeai llms look mean paper prompt prompt injection promptinjection security simplified test update

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Senior Data Engineer

@ Quantexa | Sydney, New South Wales, Australia

Staff Analytics Engineer

@ Warner Bros. Discovery | NY New York 230 Park Avenue South