all AI news
Attacking LLM Watermarks by Exploiting Their Strengths
Feb. 27, 2024, 5:43 a.m. | Qi Pang, Shengyuan Hu, Wenting Zheng, Virginia Smith
cs.LG updates on arXiv.org arxiv.org
Abstract: Advances in generative models have made it possible for AI-generated text, code, and images to mirror human-generated content in many applications. Watermarking, a technique that aims to embed information in the output of a model to verify its source, is useful for mitigating misuse of such AI-generated content. However, existing watermarking schemes remain surprisingly susceptible to attack. In particular, we show that desirable properties shared by existing LLM watermarking systems such as quality preservation, robustness, …
abstract advances ai-generated content ai-generated text applications arxiv code cs.cl cs.cr cs.lg embed generated generative generative models human images information llm misuse text type verify watermarking watermarks
More from arxiv.org / cs.LG updates on arXiv.org
Jobs in AI, ML, Big Data
Data Engineer
@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US
Research Engineer
@ Allora Labs | Remote
Ecosystem Manager
@ Allora Labs | Remote
Founding AI Engineer, Agents
@ Occam AI | New York