Feb. 28, 2024, 5:49 a.m. | Duarte M. Alves, Jos\'e Pombal, Nuno M. Guerreiro, Pedro H. Martins, Jo\~ao Alves, Amin Farajian, Ben Peters, Ricardo Rei, Patrick Fernandes, Sweta Ag

cs.CL updates on arXiv.org arxiv.org

arXiv:2402.17733v1 Announce Type: new
Abstract: While general-purpose large language models (LLMs) demonstrate proficiency on multiple tasks within the domain of translation, approaches based on open LLMs are competitive only when specializing on a single task. In this paper, we propose a recipe for tailoring LLMs to multiple tasks present in translation workflows. We perform continued pretraining on a multilingual mixture of monolingual and parallel data, creating TowerBase, followed by finetuning on instructions relevant for translation processes, creating TowerInstruct. Our final …

abstract arxiv cs.cl domain general language language model language models large language large language model large language models llms multilingual multiple paper recipe tasks translation type

Senior Machine Learning Engineer

@ GPTZero | Toronto, Canada

ML/AI Engineer / NLP Expert - Custom LLM Development (x/f/m)

@ HelloBetter | Remote

Doctoral Researcher (m/f/div) in Automated Processing of Bioimages

@ Leibniz Institute for Natural Product Research and Infection Biology (Leibniz-HKI) | Jena

Seeking Developers and Engineers for AI T-Shirt Generator Project

@ Chevon Hicks | Remote

Stage-Automne – Intelligence d’affaires pour l’après-marché connecté /Internship-Fall-Connected Aftermarket Business Intelligence

@ RTX | LOC13052 1000 Boul Marie Victorin,Longueuil,Quebec,J4G 1A1,Canada

Business Intelligence Analyst Health Plan Operations

@ Corewell Health | SITE - Priority Health - 1239 E Beltline - Grand Rapids