Building Guardrails for Large Language Models | allainews.com

Feb. 6, 2024, 5:53 a.m. | Yi Dong Ronghui Mu Gaojie Jin Yi Qi Jinwei Hu Xingyu Zhao Jie Meng Wenjie Ruan Xiaowei

cs.CL updates on arXiv.org arxiv.org

As Large Language Models (LLMs) become more integrated into our daily lives, it is crucial to identify and mitigate their risks, especially when the risks can have profound impacts on human users and societies. Guardrails, which filter the inputs or outputs of LLMs, have emerged as a core safeguarding technology. This position paper takes a deep look at current open-source solutions (Llama Guard, Nvidia NeMo, Guardrails AI), and discusses the challenges and the road towards building more complete solutions. Drawing …

become building core cs.ai cs.cl daily filter guardrails human identify impacts inputs language language models large language large language models llms paper risks technology

More from arxiv.org / cs.CL updates on arXiv.org

Designing LLM Chains by Adapting Techniques from Crowdsourcing Workflows 19 hours ago | arxiv.org

abstract arxiv crowdsourcing cs.ai +13

GraphGPT: Graph Instruction Tuning for Large Language Models 19 hours ago | arxiv.org

arxiv cs.ai cs.cl graph +6

How Fragile is Relation Extraction under Entity Replacements? 19 hours ago | arxiv.org

arxiv cs.ai cs.cl extraction +1

Granite Code Models: A Family of Open Foundation Models for Code Intelligence 19 hours ago | arxiv.org

abstract agents arxiv code +25

Enriched BERT Embeddings for Scholarly Publication Classification 19 hours ago | arxiv.org

abstract academic articles arxiv +16

Sketch Then Generate: Providing Incremental User Feedback and Guiding LLM Code Generation through Language-Oriented Code … 19 hours ago | arxiv.org

abstract arxiv code code generation +20

HAFFormer: A Hierarchical Attention-Free Framework for Alzheimer's Disease Detection From Spontaneous Speech 19 hours ago | arxiv.org

abstract alzheimer's architectures arxiv +22

CleanGraph: Human-in-the-loop Knowledge Graph Refinement and Completion 19 hours ago | arxiv.org

arxiv cs.ai cs.cl graph +5

Conformity, Confabulation, and Impersonation: Persona Inconstancy in Multi-Agent LLM Collaboration 19 hours ago | arxiv.org

abstract agent agents analyze +19

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net