Tailoring Self-Rationalizers with Multi-Reward Distillation | allainews.com

May 24, 2024, 4:55 a.m. | Sahana Ramnath, Brihi Joshi, Skyler Hallinan, Ximing Lu, Liunian Harold Li, Aaron Chan, Jack Hessel, Yejin Choi, Xiang Ren

cs.CL updates on arXiv.org arxiv.org

arXiv:2311.02805v2 Announce Type: replace
Abstract: Large language models (LMs) are capable of generating free-text rationales to aid question answering. However, prior work 1) suggests that useful self-rationalization is emergent only at significant scales (e.g., 175B parameter GPT-3); and 2) focuses largely on downstream performance, ignoring the semantics of the rationales themselves, e.g., are they faithful, true, and helpful for humans? In this work, we enable small-scale LMs (approx. 200x smaller than GPT-3) to generate rationales that not only improve downstream …

abstract arxiv cs.cl distillation free gpt gpt-3 however language language models large language large language models lms performance prior question question answering replace semantics text type work

More from arxiv.org / cs.CL updates on arXiv.org

Multimodal Learning Without Labeled Multimodal Data: Guarantees and Applications 2 days, 14 hours ago | arxiv.org

abstract applications arxiv challenge +26

Unlearning Traces the Influential Training Data of Language Models 2 days, 14 hours ago | arxiv.org

abstract arxiv cs.ai cs.cl +17

Axis Tour: Word Tour Determines the Order of Axes in ICA-transformed Embeddings 2 days, 14 hours ago | arxiv.org

abstract analysis arxiv components +20

Japanese Tort-case Dataset for Rationale-supported Legal Judgment Prediction 2 days, 14 hours ago | arxiv.org

abstract arxiv case court +14

MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI 2 days, 14 hours ago | arxiv.org

abstract agi art arxiv +21

ConceptPsy:A Benchmark Suite with Conceptual Comprehensiveness in Psychology 2 days, 14 hours ago | arxiv.org

abstract arxiv benchmark benchmarks +19

MC$^2$: Towards Transparent and Culturally-Aware NLP for Minority Languages in China 2 days, 14 hours ago | arxiv.org

abstract accessibility arxiv challenge +19

Dodo: Dynamic Contextual Compression for Decoder-only LMs 2 days, 14 hours ago | arxiv.org

abstract arxiv attention compression +23

Active Learning for Multilingual Fingerspelling Corpora 2 days, 14 hours ago | arxiv.org

abstract active learning analysis apply +16

Senior Data Engineer

@ Displate | Warsaw

View on ai-jobs.net

Junior Data Analyst - ESG Data

@ Institutional Shareholder Services | Mumbai

View on ai-jobs.net

Intern Data Driven Development in Sensor Fusion for Autonomous Driving (f/m/x)

@ BMW Group | Munich, DE

View on ai-jobs.net

Senior MLOps Engineer, Machine Learning Platform

@ GetYourGuide | Berlin

View on ai-jobs.net

Data Engineer, Analytics

@ Meta | Menlo Park, CA

View on ai-jobs.net

Data Engineer

@ Meta | Menlo Park, CA

View on ai-jobs.net