Distributional Preference Learning: Understanding and Accounting for Hidden Context in RLHF | allainews.com

April 18, 2024, 4:43 a.m. | Anand Siththaranjan, Cassidy Laidlaw, Dylan Hadfield-Menell

stat.ML updates on arXiv.org arxiv.org

arXiv:2312.08358v2 Announce Type: replace-cross
Abstract: In practice, preference learning from human feedback depends on incomplete data with hidden context. Hidden context refers to data that affects the feedback received, but which is not represented in the data used to train a preference model. This captures common issues of data collection, such as having human annotators with varied preferences, cognitive processes that result in seemingly irrational behavior, and combining data labeled according to different criteria. We prove that standard applications of …

accounting arxiv context cs.ai cs.lg hidden rlhf stat.ml type understanding

More from arxiv.org / stat.ML updates on arXiv.org

Filtered Partial Differential Equations: a robust surrogate constraint in physics-informed deep learning framework 15 hours ago | arxiv.org

abstract arxiv data deep learning +24

Long-term Causal Inference Under Persistent Confounding via Data Combination 15 hours ago | arxiv.org

abstract arxiv causal causal inference +18

Simplifying Debiased Inference via Automatic Differentiation and Probabilistic Programming 15 hours ago | arxiv.org

abstract algorithm arxiv audience +17

Mutual information and the encoding of contingency tables 15 hours ago | arxiv.org

abstract arxiv classification community +15

Entropic estimation of optimal transport maps 1 day, 15 hours ago | arxiv.org

abstract algorithm arxiv compute +15

Uniform Inference for Subsampled Moment Regression 1 day, 15 hours ago | arxiv.org

abstract algorithms arxiv class +15

Partial information decomposition as information bottleneck 1 day, 15 hours ago | arxiv.org

abstract arxiv cs.it information +6

Adaptive-TMLE for the Average Treatment Effect based on Randomized Controlled Trial Augmented with Real-World Data 1 day, 15 hours ago | arxiv.org

abstract arxiv control data +6

Nonlinear classification of neural manifolds with contextual information 1 day, 15 hours ago | arxiv.org

abstract analyze arxiv attributes +23

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net