Beyond Prompting: Making Pre-trained Language Models Better Zero-shot Learners by Clustering Representations. (arXiv:2210.16637v2 [cs.CL] UPDATED) | allainews.com

Nov. 24, 2022, 7:18 a.m. | Yu Fei, Ping Nie, Zhao Meng, Roger Wattenhofer, Mrinmaya Sachan

cs.CL updates on arXiv.org arxiv.org

Recent work has demonstrated that pre-trained language models (PLMs) are
zero-shot learners. However, most existing zero-shot methods involve heavy
human engineering or complicated self-training pipelines, hindering their
application to new situations. In this work, we show that zero-shot text
classification can be improved simply by clustering texts in the embedding
spaces of PLMs. Specifically, we fit the unlabeled texts with a Bayesian
Gaussian Mixture Model after initializing cluster positions and shapes using
class names. Despite its simplicity, this approach achieves …

arxiv clustering language language models making

More from arxiv.org / cs.CL updates on arXiv.org

Drop your Decoder: Pre-training with Bag-of-Word Prediction for Dense Passage Retrieval 10 hours ago | arxiv.org

abstract arxiv auto bag +17

Does GPT-4 pass the Turing test? 10 hours ago | arxiv.org

abstract arxiv cs.ai cs.cl +16

Carpe Diem: On the Evaluation of World Knowledge in Lifelong Language Models 10 hours ago | arxiv.org

abstract arxiv challenges cs.cl +13

COPAL-ID: Indonesian Language Reasoning with Local Culture and Nuances 10 hours ago | arxiv.org

abstract arxiv causal common sense +11

Empirical study of pretrained multilingual language models for zero-shot cross-lingual knowledge transfer in generation 10 hours ago | arxiv.org

abstract arxiv cross-lingual cs.cl +17

SemStamp: A Semantic Watermark with Paraphrastic Robustness for Text Generation 10 hours ago | arxiv.org

abstract algorithm algorithms arxiv +19

C-Pack: Packaged Resources To Advance General Chinese Embedding 10 hours ago | arxiv.org

advance arxiv chinese cs.ai +6

$\rm SP^3$: Enhancing Structured Pruning via PCA Projection 10 hours ago | arxiv.org

abstract arxiv cs.ai cs.cl +12

Matching Patients to Clinical Trials with Large Language Models 10 hours ago | arxiv.org

abstract arxiv challenge clinical +19

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

[Job - 14823] Senior Data Scientist (Data Analyst Sr)

@ CI&T | Brazil

View on ai-jobs.net

Data Engineer

@ WorldQuant | Hanoi

View on ai-jobs.net

ML Engineer / Toronto

@ Intersog | Toronto, Ontario, Canada

View on ai-jobs.net

Analista de Business Intelligence (Industry Insights)

@ NielsenIQ | Cotia, Brazil

View on ai-jobs.net