A Multilingual Bag-of-Entities Model for Zero-Shot Cross-Lingual Text Classification. (arXiv:2110.07792v2 [cs.CL] UPDATED) | allainews.com

Oct. 12, 2022, 1:17 a.m. | Sosuke Nishikawa, Ikuya Yamada, Yoshimasa Tsuruoka, Isao Echizen

cs.CL updates on arXiv.org arxiv.org

We present a multilingual bag-of-entities model that effectively boosts the
performance of zero-shot cross-lingual text classification by extending a
multilingual pre-trained language model (e.g., M-BERT). It leverages the
multilingual nature of Wikidata: entities in multiple languages representing
the same concept are defined with a unique identifier. This enables entities
described in multiple languages to be represented using shared embeddings. A
model trained on entity features in a resource-rich language can thus be
directly applied to other languages. Our experimental results …

arxiv bag classification cross-lingual text text classification

More from arxiv.org / cs.CL updates on arXiv.org

Gradient Flow of Energy: A General and Efficient Approach for Entity Alignment Decoding 15 hours ago | arxiv.org

abstract alignment arxiv cs.cl +19

Recommender Systems in the Era of Large Language Models (LLMs) 15 hours ago | arxiv.org

abstract applications arxiv become +23

EE-TTS: Emphatic Expressive TTS with Linguistic Information 15 hours ago | arxiv.org

abstract arxiv attention challenge +12

Raidar: geneRative AI Detection viA Rewriting 15 hours ago | arxiv.org

abstract ai detection ai-generated content ai-generated text +18

GeoGalactica: A Scientific Large Language Model in Geoscience 15 hours ago | arxiv.org

abstract applications arxiv cs.cl +25

MMC: Advancing Multimodal Chart Understanding with Large-scale Instruction Tuning 15 hours ago | arxiv.org

arxiv cs.ai cs.cl multimodal +3

ContraDoc: Understanding Self-Contradictions in Documents with Large Language Models 15 hours ago | arxiv.org

arxiv cs.cl documents language +5

AudioChatLlama: Towards General-Purpose Speech Abilities for LLMs 15 hours ago | arxiv.org

abstract arxiv audio capabilities +17

Adapting Fake News Detection to the Era of Large Language Models 15 hours ago | arxiv.org

abstract adoption age arxiv +18

Data Scientist (m/f/x/d)

@ Symanto Research GmbH & Co. KG | Spain, Germany

View on ai-jobs.net

Robotics Technician - Weekend Day Shift

@ GXO Logistics | Hillsboro, OR, US, 97124

View on ai-jobs.net

Gen AI Developer

@ NTT DATA | Irving, TX, US

View on ai-jobs.net

Applied AI/ML - Vice President

@ JPMorgan Chase & Co. | LONDON, United Kingdom

View on ai-jobs.net

Research Fellow (Computer Science/Engineering/AI)

@ Nanyang Technological University | NTU Main Campus, Singapore

View on ai-jobs.net

Senior Machine Learning Engineer

@ Rasa | Remote - Germany

View on ai-jobs.net