all AI news
WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models. (arXiv:2112.06598v2 [cs.CL] UPDATED)
Web: http://arxiv.org/abs/2112.06598
May 5, 2022, 1:11 a.m. | Benjamin Minixhofer, Fabian Paischer, Navid Rekabsaz
cs.CL updates on arXiv.org arxiv.org
Large pretrained language models (LMs) have become the central building block
of many NLP applications. Training these models requires ever more
computational resources and most of the existing models are trained on English
text only. It is exceedingly expensive to train these models in other
languages. To alleviate this problem, we introduce a novel method -- called
WECHSEL -- to efficiently and effectively transfer pretrained LMs to new
languages. WECHSEL can be applied to any model which uses subword-based
tokenization …
arxiv cross cross-lingual language language models models transfer
More from arxiv.org / cs.CL updates on arXiv.org
The Budge programming language. (arXiv:2205.07979v2 [cs.PL] UPDATED)
2 days, 18 hours ago |
arxiv.org
Latest AI/ML/Big Data Jobs
Data Analyst, Patagonia Action Works
@ Patagonia | Remote
Data & Insights Strategy & Innovation General Manager
@ Chevron Services Company, a division of Chevron U.S.A Inc. | Houston, TX
Faculty members in Research areas such as Bayesian and Spatial Statistics; Data Privacy and Security; AI/ML; NLP; Image and Video Data Analysis
@ Ahmedabad University | Ahmedabad, India
Director, Applied Mathematics & Computational Research Division
@ Lawrence Berkeley National Lab | Berkeley, Ca
Business Data Analyst
@ MainStreet Family Care | Birmingham, AL
Assistant/Associate Professor of the Practice in Business Analytics
@ Georgetown University McDonough School of Business | Washington DC