Donkii: Can Annotation Error Detection Methods Find Errors in Instruction-Tuning Datasets? | allainews.com

Feb. 23, 2024, 5:49 a.m. | Leon Weber-Genzel, Robert Litschko, Ekaterina Artemova, Barbara Plank

cs.CL updates on arXiv.org arxiv.org

arXiv:2309.01669v2 Announce Type: replace
Abstract: Instruction tuning has become an integral part of training pipelines for Large Language Models (LLMs) and has been shown to yield strong performance gains. In an orthogonal line of research, Annotation Error Detection (AED) has emerged as a tool for detecting quality problems in gold standard labels. So far, however, the application of AED methods has been limited to classification tasks. It is an open question how well AED methods generalize to language generation settings, …

abstract annotation arxiv become cs.cl datasets detection detection methods error errors integral language language models large language large language models line llms part performance pipelines quality research tool training type

More from arxiv.org / cs.CL updates on arXiv.org

Multimodal Learning Without Labeled Multimodal Data: Guarantees and Applications 3 hours ago | arxiv.org

abstract applications arxiv challenge +26

Unlearning Traces the Influential Training Data of Language Models 3 hours ago | arxiv.org

abstract arxiv cs.ai cs.cl +17

Axis Tour: Word Tour Determines the Order of Axes in ICA-transformed Embeddings 3 hours ago | arxiv.org

abstract analysis arxiv components +20

Japanese Tort-case Dataset for Rationale-supported Legal Judgment Prediction 3 hours ago | arxiv.org

abstract arxiv case court +14

MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI 3 hours ago | arxiv.org

abstract agi art arxiv +21

ConceptPsy:A Benchmark Suite with Conceptual Comprehensiveness in Psychology 3 hours ago | arxiv.org

abstract arxiv benchmark benchmarks +19

MC$^2$: Towards Transparent and Culturally-Aware NLP for Minority Languages in China 3 hours ago | arxiv.org

abstract accessibility arxiv challenge +19

Dodo: Dynamic Contextual Compression for Decoder-only LMs 3 hours ago | arxiv.org

abstract arxiv attention compression +23

Active Learning for Multilingual Fingerspelling Corpora 3 hours ago | arxiv.org

abstract active learning analysis apply +16

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

View on ai-jobs.net

Head of Statistical Programming – US

@ Sobi | Waltham, MA, United States

View on ai-jobs.net

Data Lead Engineer

@ Capco | Brazil - Sao Paulo

View on ai-jobs.net

Intern Assistant Researcher - mmWave Imaging

@ Huawei Technologies Canada Co., Ltd. | Ottawa, Ontario, Canada

View on ai-jobs.net

Hardware Test Engineer, Amazon Robotics Hardware Test

@ Amazon.com | North Reading, Massachusetts, USA

View on ai-jobs.net

Mechanical Design Engineer (Aircraft Interiors)

@ Segula Technologies | Mexico City, Mexico

View on ai-jobs.net