Multilingual Jailbreak Challenges in Large Language Models | allainews.com

Feb. 29, 2024, 5:48 a.m. | Yue Deng, Wenxuan Zhang, Sinno Jialin Pan, Lidong Bing

cs.CL updates on arXiv.org arxiv.org

arXiv:2310.06474v2 Announce Type: replace
Abstract: While large language models (LLMs) exhibit remarkable capabilities across a wide range of tasks, they pose potential safety concerns, such as the ``jailbreak'' problem, wherein malicious instructions can manipulate LLMs to exhibit undesirable behavior. Although several preventive measures have been developed to mitigate the potential risks associated with LLMs, they have primarily focused on English. In this study, we reveal the presence of multilingual jailbreak challenges within LLMs and consider two potential risky scenarios: unintentional …

arxiv challenges cs.cl jailbreak language language models large language large language models multilingual type

More from arxiv.org / cs.CL updates on arXiv.org

ChatDev: Communicative Agents for Software Development 22 hours ago | arxiv.org

agents arxiv chatdev communicative agents +8

Right to be Forgotten in the Era of Large Language Models: Implications, Challenges, and Solutions 22 hours ago | arxiv.org

abstract arxiv challenges cs.ai +18

JumpCoder: Go Beyond Autoregressive Coder via Online Modification 22 hours ago | arxiv.org

arxiv autoregressive beyond coder +6

Building Efficient and Effective OpenQA Systems for Low-Resource Languages 22 hours ago | arxiv.org

arxiv building cs.cl languages +4

WaveCoder: Widespread And Versatile Enhancement For Code Large Language Models By Instruction Tuning 22 hours ago | arxiv.org

abstract arxiv capabilities code +18

Simul-LLM: A Framework for Exploring High-Quality Simultaneous Translation with Large Language Models 22 hours ago | arxiv.org

abstract art arxiv cs.ai +27

Uncertainty Estimation on Sequential Labeling via Uncertainty Transmission 22 hours ago | arxiv.org

arxiv cs.cl labeling replace +3

FollowBench: A Multi-level Fine-grained Constraints Following Benchmark for Large Language Models 22 hours ago | arxiv.org

arxiv benchmark constraints cs.cl +7

PartialFormer: Modeling Part Instead of Whole for Machine Translation 22 hours ago | arxiv.org

arxiv cs.ai cs.cl machine +6

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

View on ai-jobs.net

ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)

@ HelloBetter | Remote

View on ai-jobs.net

Doctoral Researcher (m/f/div) in Automated Processing of Bioimages

@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena

View on ai-jobs.net

Director, Global Success Business Intelligence

@ Salesforce | Texas - Austin

View on ai-jobs.net

Deep Learning Compiler Engineer - MLIR

@ NVIDIA | US, CA, Santa Clara

View on ai-jobs.net

Commerce Data Engineer (Remote)

@ CrowdStrike | USA TX Remote

View on ai-jobs.net