March 28, 2024, 4:42 a.m. | Tarun Suresh, Shubham Ugare, Gagandeep Singh, Sasa Misailovic

cs.LG updates on arXiv.org arxiv.org

arXiv:2403.17983v1 Announce Type: cross
Abstract: We present the first study of the robustness of existing watermarking techniques on Python code generated by large language models. Although existing works showed that watermarking can be robust for natural language, we show that it is easy to remove these watermarks on code by semantic-preserving transformations.

abstract arxiv code cs.cr cs.lg easy generated language language models large language large language models llm natural natural language python robust robustness semantic show study type watermarking watermarks

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)

@ HelloBetter | Remote

Doctoral Researcher (m/f/div) in Automated Processing of Bioimages

@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena

Seeking Developers and Engineers for AI T-Shirt Generator Project

@ Chevon Hicks | Remote

Principal Data Architect - Azure & Big Data

@ MGM Resorts International | Home Office - US, NV

GN SONG MT Market Research Data Analyst 11

@ Accenture | Bengaluru, BDC7A