Automated Data Curation for Robust Language Model Fine-Tuning | allainews.com

March 20, 2024, 4:48 a.m. | Jiuhai Chen, Jonas Mueller

cs.CL updates on arXiv.org arxiv.org

arXiv:2403.12776v1 Announce Type: new
Abstract: Large Language Models have become the de facto approach to sequence-to-sequence text generation tasks, but for specialized tasks/domains, a pretrained LLM lacks specific capabilities to produce accurate or well-formatted responses. Supervised fine-tuning specializes a LLM by training it on dataset of example prompts with target responses, but real-world data tends to be noisy. While many fine-tuning algorithms exist, here we consider a \emph{data-centric AI} perspective on LLM fine-tuning, studying how to \emph{systematically} curate the training …

abstract arxiv automated become capabilities cs.cl curation data data curation dataset domains example fine-tuning language language model language models large language large language models llm model fine-tuning prompts responses robust supervised fine-tuning tasks text text generation training type

More from arxiv.org / cs.CL updates on arXiv.org

Multi-label Text Classification using GloVe and Neural Network Models 1 day, 6 hours ago | arxiv.org

abstract arxiv challenges classification +21

CFBenchmark: Chinese Financial Assistant Benchmark for Large Language Model 1 day, 6 hours ago | arxiv.org

arxiv assistant benchmark chinese +8

Leveraging text data for causal inference using electronic health records 1 day, 6 hours ago | arxiv.org

abstract arxiv causal causal inference +22

How do languages influence each other? Studying cross-lingual data sharing during LM fine-tuning 1 day, 6 hours ago | arxiv.org

abstract arxiv benefit cross-lingual +20

Unsupervised Multimodal Clustering for Semantics Discovery in Multimodal Utterances 1 day, 6 hours ago | arxiv.org

arxiv clustering cs.ai cs.cl +6

RecGPT: Generative Pre-training for Text-based Recommendation 1 day, 6 hours ago | arxiv.org

arxiv cs.cl cs.ir generative +5

From Human-to-Human to Human-to-Bot Conversations in Software Engineering 1 day, 6 hours ago | arxiv.org

abstract aim arxiv bot +21

ProtT3: Protein-to-Text Generation for Text-based Protein Understanding 1 day, 6 hours ago | arxiv.org

arxiv cs.cl cs.mm protein +5

CoCo Matrix: Taxonomy of Cognitive Contributions in Co-writing with Intelligent Agents 1 day, 6 hours ago | arxiv.org

abstract agents arxiv coco +14

Seeking Developers and Engineers for AI T-Shirt Generator Project

@ Chevon Hicks | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

View on ai-jobs.net

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

View on ai-jobs.net

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Senior Associate, Data and Analytics

@ Publicis Groupe | New York City, United States

View on ai-jobs.net