Distributional Preference Learning: Understanding and Accounting for Hidden Context in RLHF | allainews.com

April 18, 2024, 4:43 a.m. | Anand Siththaranjan, Cassidy Laidlaw, Dylan Hadfield-Menell

stat.ML updates on arXiv.org arxiv.org

arXiv:2312.08358v2 Announce Type: replace-cross
Abstract: In practice, preference learning from human feedback depends on incomplete data with hidden context. Hidden context refers to data that affects the feedback received, but which is not represented in the data used to train a preference model. This captures common issues of data collection, such as having human annotators with varied preferences, cognitive processes that result in seemingly irrational behavior, and combining data labeled according to different criteria. We prove that standard applications of …

accounting arxiv context cs.ai cs.lg hidden rlhf stat.ml type understanding

More from arxiv.org / stat.ML updates on arXiv.org

A Strategy for Preparing Quantum Squeezed States Using Reinforcement Learning 19 hours ago | arxiv.org

abstract application arxiv collective +13

Robust Bayesian Inference for Berkson and Classical Measurement Error Models 19 hours ago | arxiv.org

abstract arxiv bayesian bayesian inference +11

Confidence Intervals for Error Rates in 1:1 Matching Tasks: Critical Statistical Analysis and Recommendations 19 hours ago | arxiv.org

abstract algorithm algorithms analysis +17

Statistical Inference for Linear Functionals of Online SGD in High-dimensional Linear Regression 19 hours ago | arxiv.org

abstract applications arxiv data +19

Convergence analysis of online algorithms for vector-valued kernel regression 19 hours ago | arxiv.org

abstract algorithm algorithms analysis +18

Tensor cumulants for statistical inference on invariant distributions 19 hours ago | arxiv.org

abstract arxiv canonical computational +18

Control randomisation approach for policy gradient and application to reinforcement learning in optimal switching 19 hours ago | arxiv.org

abstract application applications arxiv +14

Conformal Ranked Retrieval 19 hours ago | arxiv.org

abstract adoption arxiv control +16

Likelihood Based Inference in Fully and Partially Observed Exponential Family Graphical Models with Intractable Normalizing … 19 hours ago | arxiv.org

abstract arxiv building data +18

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Tableau/PowerBI Developer (A.Con)

@ KPMG India | Bengaluru, Karnataka, India

View on ai-jobs.net

Software Engineer, Backend - Data Platform (Big Data Infra)

@ Benchling | San Francisco, CA

View on ai-jobs.net