Exploiting Word Semantics to Enrich Character Representations of Chinese Pre-trained Models. (arXiv:2207.05928v1 [cs.CL]) | allainews.com

July 14, 2022, 1:12 a.m. | Wenbiao Li, Rui Sun, Yunfang Wu

cs.CL updates on arXiv.org arxiv.org

Most of the Chinese pre-trained models adopt characters as basic units for
downstream tasks. However, these models ignore the information carried by words
and thus lead to the loss of some important semantics. In this paper, we
propose a new method to exploit word structure and integrate lexical semantics
into character representations of pre-trained models. Specifically, we project
a word's embedding into its internal characters' embeddings according to the
similarity weight. To strengthen the word boundary information, we mix the …

arxiv pre-trained models semantics

More from arxiv.org / cs.CL updates on arXiv.org

Vesper: A Compact and Effective Pretrained Model for Speech Emotion Recognition 1 day ago | arxiv.org

abstract artificial artificial general intelligence arxiv +19

Visually grounded few-shot word learning in low-resource settings 1 day ago | arxiv.org

abstract arxiv cs.cl eess.as +16

KTRL+F: Knowledge-Augmented In-Document Search 1 day ago | arxiv.org

abstract arxiv challenges cs.cl +12

Knowledgeable Preference Alignment for LLMs in Domain-specific Question Answering 1 day ago | arxiv.org

abstract alignment applications arxiv +19

Hint-enhanced In-Context Learning wakes Large Language Models up for knowledge-intensive tasks 1 day ago | arxiv.org

abstract arxiv context cs.cl +17

LibriSQA: A Novel Dataset and Framework for Spoken Question Answering with Large Language Models 1 day ago | arxiv.org

arxiv cs.cl dataset framework +9

Efficient Sentiment Analysis: A Resource-Aware Evaluation of Feature Extraction Techniques, Ensembling, and Deep Learning Models 1 day ago | arxiv.org

abstract accuracy analysis arxiv +18

Self-Polish: Enhance Reasoning in Large Language Models via Problem Refinement 1 day ago | arxiv.org

arxiv cs.ai cs.cl language +6

MFE-NER: Multi-feature Fusion Embedding for Chinese Named Entity Recognition 1 day ago | arxiv.org

abstract arxiv characters chinese +10

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

Data Analyst (CPS-GfK)

@ GfK | Bucharest

View on ai-jobs.net

Consultant Data Analytics IT Digital Impulse - H/F

@ Talan | Paris, France

View on ai-jobs.net

Data Analyst

@ Experian | Mumbai, India

View on ai-jobs.net

Data Scientist

@ Novo Nordisk | Princeton, NJ, US

View on ai-jobs.net

Data Architect IV

@ Millennium Corporation | United States

View on ai-jobs.net