s
April 16, 2024, 9:28 p.m. |

Simon Willison's Weblog simonwillison.net

Google NotebookLM Data Exfiltration


NotebookLM is a Google Labs product that lets you store information as sources (mainly text files in PDF) and then ask questions against those sources - effectively an interface for building your own custom RAG (Retrieval Augmented Generation) chatbots.


Unsurprisingly for anything that allows LLMs to interact with untrusted documents, it's susceptible to prompt injection.


Johann Rehberger found some classic prompt injection exfiltration attacks: you can create source documents with instructions that cause the chatbot to …

ai building chatbots data documents files generativeai google google notebooklm information labs llms notebooklm pdf product promptinjection questions rag retrieval retrieval augmented generation security store text

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

Business Intelligence Manager

@ Sanofi | Budapest

Principal Engineer, Data (Hybrid)

@ Homebase | Toronto, Ontario, Canada