Tailoring Self-Rationalizers with Multi-Reward Distillation | allainews.com

May 24, 2024, 4:55 a.m. | Sahana Ramnath, Brihi Joshi, Skyler Hallinan, Ximing Lu, Liunian Harold Li, Aaron Chan, Jack Hessel, Yejin Choi, Xiang Ren

cs.CL updates on arXiv.org arxiv.org

arXiv:2311.02805v2 Announce Type: replace
Abstract: Large language models (LMs) are capable of generating free-text rationales to aid question answering. However, prior work 1) suggests that useful self-rationalization is emergent only at significant scales (e.g., 175B parameter GPT-3); and 2) focuses largely on downstream performance, ignoring the semantics of the rationales themselves, e.g., are they faithful, true, and helpful for humans? In this work, we enable small-scale LMs (approx. 200x smaller than GPT-3) to generate rationales that not only improve downstream …

abstract arxiv cs.cl distillation free gpt gpt-3 however language language models large language large language models lms performance prior question question answering replace semantics text type work

More from arxiv.org / cs.CL updates on arXiv.org

Open Models, Closed Minds? On Agents Capabilities in Mimicking Human Personalities through Open Large Language … 23 hours ago | arxiv.org

abstract agents arxiv capabilities +22

Attribute Diversity Determines the Systematicity Gap in VQA 23 hours ago | arxiv.org

abstract arxiv concepts cs.cl +16

Linking Representations with Multimodal Contrastive Learning 23 hours ago | arxiv.org

abstract applications arxiv cs.cl +17

A Survey on Neural Topic Models: Methods, Applications, and Challenges 23 hours ago | arxiv.org

applications arxiv challenges cs.ai +5

Revisiting Demonstration Selection Strategies in In-Context Learning 23 hours ago | arxiv.org

abstract arxiv context context learning +15

The Impact of Reasoning Step Length on Large Language Models 23 hours ago | arxiv.org

arxiv cs.ai cs.cl impact +7

Improving In-context Learning via Bidirectional Alignment 23 hours ago | arxiv.org

abstract alignment arxiv capabilities +24

Collaboration or Corporate Capture? Quantifying NLP's Reliance on Industry Artifacts and Contributions 23 hours ago | arxiv.org

abstract artifacts arxiv attention +19

Graph Elicitation for Guiding Multi-Step Reasoning in Large Language Models 23 hours ago | arxiv.org

abstract arxiv capabilities cs.ai +20

AI Focused Biochemistry Postdoctoral Fellow

@ Lawrence Berkeley National Lab | Berkeley, CA

View on ai-jobs.net

Senior Data Engineer

@ Displate | Warsaw

View on ai-jobs.net

Associate Director, IT Business Partner, Cell Therapy Analytical Development

@ Bristol Myers Squibb | Warren - NJ

View on ai-jobs.net

Solutions Architect

@ Lloyds Banking Group | London 125 London Wall

View on ai-jobs.net

Senior Lead Cloud Engineer

@ S&P Global | IN - HYDERABAD ORION

View on ai-jobs.net

Software Engineer

@ Applied Materials | Bengaluru,IND

View on ai-jobs.net