April 16, 2024, 4:51 a.m. | Adi Simhi, Jonathan Herzig, Idan Szpektor, Yonatan Belinkov

cs.CL updates on arXiv.org arxiv.org

arXiv:2404.09971v1 Announce Type: new
Abstract: Large language models (LLMs) are susceptible to hallucination, which sparked a widespread effort to detect and prevent them. Recent work attempts to mitigate hallucinations by intervening in the model's computation during generation, using different setups and heuristics. Those works lack separation between different hallucination causes. In this work, we first introduce an approach for constructing datasets based on the model knowledge for detection and intervention methods in closed-book and open-book question-answering settings. We then characterize …

arxiv benchmarks cs.cl hallucinations llms type

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)

@ HelloBetter | Remote

Doctoral Researcher (m/f/div) in Automated Processing of Bioimages

@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena

Seeking Developers and Engineers for AI T-Shirt Generator Project

@ Chevon Hicks | Remote

Real World Evidence Research Analyst

@ Novartis | Dublin (Novartis Global Service Center (NGSC))

Senior DataOps Engineer

@ Winterthur Gas & Diesel AG | Winterthur, CH