Cold-Start Data Selection for Few-shot Language Model Fine-tuning: A Prompt-Based Uncertainty Propagation Approach. (arXiv:2209.06995v1 [cs.CL]) | allainews.com

Sept. 16, 2022, 1:16 a.m. | Yue Yu, Rongzhi Zhang, Ran Xu, Jieyu Zhang, Jiaming Shen, Chao Zhang

cs.CL updates on arXiv.org arxiv.org

We propose PATRON, a new method that uses prompt-based uncertainty estimation
for data selection for pre-trained language model fine-tuning under cold-start
scenarios, i.e., no initial labeled data are available. In PATRON, we design
(1) a prompt-based uncertainty propagation approach to estimate the importance
of data points and (2) a partition-then-rewrite (PTR) strategy to promote
sample diversity when querying for annotations. Experiments on six text
classification datasets show that PATRON outperforms the strongest cold-start
data selection baselines by up to 6.9%. …

arxiv data fine-tuning language language model model fine-tuning uncertainty

More from arxiv.org / cs.CL updates on arXiv.org

A Survey of Graph Meets Large Language Model: Progress and Future Directions 11 hours ago | arxiv.org

arxiv cs.cl cs.lg cs.si +9

Never Train from Scratch: Fair Comparison of Long-Sequence Models Requires Data-Driven Priors 11 hours ago | arxiv.org

abstract architectures arxiv benchmarks +18

LLMCheckup: Conversational Examination of Large Language Models via Interpretability Tools and Self-Explanations 11 hours ago | arxiv.org

abstract arxiv conversational cs.ai +17

DP-NMT: Scalable Differentially-Private Machine Translation 11 hours ago | arxiv.org

abstract arxiv concerns concrete +22

DEFT: Data Efficient Fine-Tuning for Pre-Trained Language Models via Unsupervised Core-Set Selection 11 hours ago | arxiv.org

abstract advances arxiv availability +16

RoleLLM: Benchmarking, Eliciting, and Enhancing Role-Playing Abilities of Large Language Models 11 hours ago | arxiv.org

abstract art arxiv benchmarking +21

Emotionally Numb or Empathetic? Evaluating How LLMs Feel Using EmotionBench 11 hours ago | arxiv.org

arxiv cs.cl llms type

Noise-Robust De-Duplication at Scale 11 hours ago | arxiv.org

abstract applications articles arxiv +18

ICDM 2020 Knowledge Graph Contest: Consumer Event-Cause Extraction 11 hours ago | arxiv.org

abstract applications arxiv attention +16

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Technology Consultant Master Data Management (w/m/d)

@ SAP | Walldorf, DE, 69190

View on ai-jobs.net

Research Engineer, Computer Vision, Google Research

@ Google | Nairobi, Kenya

View on ai-jobs.net