May 7, 2024, 4:50 a.m. | Yuval Reif, Roy Schwartz

cs.CL updates on arXiv.org arxiv.org

arXiv:2405.02743v1 Announce Type: new
Abstract: Large language models (LLMs) have shown remarkable adaptability to diverse tasks, by leveraging context prompts containing instructions, or minimal input-output examples. However, recent work revealed they also exhibit label bias -- an undesirable preference toward predicting certain answers over others. Still, detecting and measuring this bias reliably and at scale has remained relatively unexplored. In this study, we evaluate different approaches to quantifying label bias in a model's predictions, conducting a comprehensive investigation across 279 …

abstract adaptability arxiv beyond bias context cs.cl diverse examples however input-output language language models large language large language models llms measuring performance prompts tasks type work

ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)

@ HelloBetter | Remote

Doctoral Researcher (m/f/div) in Automated Processing of Bioimages

@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena

Seeking Developers and Engineers for AI T-Shirt Generator Project

@ Chevon Hicks | Remote

Cloud Data Platform Engineer

@ First Central | Home Office (Remote)

Associate Director, Data Science

@ MSD | USA - New Jersey - Rahway

Data Scientist Sr.

@ MSD | CHL - Santiago - Santiago (Calle Mariano)