all AI news
Tower: An Open Multilingual Large Language Model for Translation-Related Tasks
Feb. 28, 2024, 5:49 a.m. | Duarte M. Alves, Jos\'e Pombal, Nuno M. Guerreiro, Pedro H. Martins, Jo\~ao Alves, Amin Farajian, Ben Peters, Ricardo Rei, Patrick Fernandes, Sweta Ag
cs.CL updates on arXiv.org arxiv.org
Abstract: While general-purpose large language models (LLMs) demonstrate proficiency on multiple tasks within the domain of translation, approaches based on open LLMs are competitive only when specializing on a single task. In this paper, we propose a recipe for tailoring LLMs to multiple tasks present in translation workflows. We perform continued pretraining on a multilingual mixture of monolingual and parallel data, creating TowerBase, followed by finetuning on instructions relevant for translation processes, creating TowerInstruct. Our final …
abstract arxiv cs.cl domain general language language model language models large language large language model large language models llms multilingual multiple paper recipe tasks translation type
More from arxiv.org / cs.CL updates on arXiv.org
Jobs in AI, ML, Big Data
Senior Machine Learning Engineer
@ GPTZero | Toronto, Canada
ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)
@ HelloBetter | Remote
Doctoral Researcher (m/f/div) in Automated Processing of Bioimages
@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena
Seeking Developers and Engineers for AI T-Shirt Generator Project
@ Chevon Hicks | Remote
Stage-Automne – Intelligence d’affaires pour l’après-marché connecté /Internship-Fall-Connected Aftermarket Business Intelligence
@ RTX | LOC13052 1000 Boul Marie Victorin,Longueuil,Quebec,J4G 1A1,Canada
Business Intelligence Analyst Health Plan Operations
@ Corewell Health | SITE - Priority Health - 1239 E Beltline - Grand Rapids