Jan. 31, 2024, 3:41 p.m. | Bolei Ma Ercong Nie Shuzhou Yuan Helmut Schmid Michael F\"arber Frauke Kreuter Hinrich Sch\"utze

cs.CL updates on arXiv.org arxiv.org

Prompt-based methods have been successfully applied to multilingual pretrained language models for zero-shot cross-lingual understanding. However, most previous studies primarily focused on sentence-level classification tasks, and only a few considered token-level labeling tasks such as Named Entity Recognition (NER) and Part-of-Speech (POS) tagging. In this paper, we propose Token-Level Prompt Decomposition (ToPro), which facilitates the prompt-based method for token-level sequence labeling tasks. The ToPro method decomposes an input sentence into single tokens and applies one prompt template to each token. …

classification cross-lingual cs.cl labeling language language models multilingual ner paper part part-of-speech prompt recognition speech studies tagging tasks token understanding zero-shot

Lead Developer (AI)

@ Cere Network | San Francisco, US

Research Engineer

@ Allora Labs | Remote

Ecosystem Manager

@ Allora Labs | Remote

Founding AI Engineer, Agents

@ Occam AI | New York

AI Engineer Intern, Agents

@ Occam AI | US

AI Research Scientist

@ Vara | Berlin, Germany and Remote