Learning to Watermark LLM-generated Text via Reinforcement Learning | allainews.com

March 19, 2024, 4:41 a.m. | Xiaojun Xu, Yuanshun Yao, Yang Liu

cs.LG updates on arXiv.org arxiv.org

arXiv:2403.10553v1 Announce Type: new
Abstract: We study how to watermark LLM outputs, i.e. embedding algorithmically detectable signals into LLM-generated text to track misuse. Unlike the current mainstream methods that work with a fixed LLM, we expand the watermark design space by including the LLM tuning stage in the watermark pipeline. While prior works focus on token-level watermark that embeds signals into the output, we design a model-level watermark that embeds signals into the LLM weights, and such signals can be …

arxiv cs.ai cs.cr cs.lg generated llm reinforcement reinforcement learning text type via watermark

More from arxiv.org / cs.LG updates on arXiv.org

The Unreasonable Effectiveness of Easy Training Data for Hard Tasks 16 hours ago | arxiv.org

arxiv cs.ai cs.cl cs.lg +7

Fairness in Serving Large Language Models 16 hours ago | arxiv.org

arxiv cs.ai cs.lg cs.pf +7

Is Knowledge All Large Language Models Needed for Causal Reasoning? 16 hours ago | arxiv.org

abstract artificial artificial intelligence arxiv +26

Experiential Co-Learning of Software-Developing Agents 16 hours ago | arxiv.org

agents arxiv cs.ai cs.cl +5

Mitigating Biases for Instruction-following Language Models via Bias Neurons Elimination 16 hours ago | arxiv.org

abstract arxiv bias biases +19

Challenging the Validity of Personality Tests for Large Language Models 16 hours ago | arxiv.org

abstract arxiv become cs.ai +20

Towards Open-world Cross-Domain Sequential Recommendation: A Model-Agnostic Contrastive Denoising Approach 16 hours ago | arxiv.org

abstract aim arxiv cs.ir +17

Large Language Models Can Infer Psychological Dispositions of Social Media Users 16 hours ago | arxiv.org

abstract arxiv chatgpt cs.ai +19

The Copycat Perceptron: Smashing Barriers Through Collective Learning 16 hours ago | arxiv.org

abstract analyze arxiv binary +14

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

View on ai-jobs.net

ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)

@ HelloBetter | Remote

View on ai-jobs.net

Doctoral Researcher (m/f/div) in Automated Processing of Bioimages

@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena

View on ai-jobs.net

Coding Data Quality Auditor

@ Neuberger Berman | Work At Home-Georgia

View on ai-jobs.net

Post Graduate (Year-Round) Intern - Market Research Analyst and Agreement Support

@ National Renewable Energy Laboratory | CO - Golden

View on ai-jobs.net

Retail Analytics Engineering - Sr. Manager (Data)

@ Axalta | Woonsocket-1 CVS Drive

View on ai-jobs.net