Feb. 6, 2024, 5:45 a.m. | Moschoula Pternea Prerna Singh Abir Chakraborty Yagna Oruganti Mirco Milletari Sayli Bapat Kebei Jiang

cs.LG updates on arXiv.org arxiv.org

In this work, we review research studies that combine Reinforcement Learning (RL) and Large Language Models (LLMs), two areas that owe their momentum to the development of deep neural networks. We propose a novel taxonomy of three main classes based on the way that the two model types interact with each other. The first class, RL4LLM, includes studies where RL is leveraged to improve the performance of LLMs on tasks related to Natural Language Processing. L4LLM is divided into two …

cs.ai cs.cl cs.lg cs.ro development language language models large language large language models llm llms networks neural networks novel reinforcement reinforcement learning research review studies taxonomy tree work

AI Research Scientist

@ Vara | Berlin, Germany and Remote

Data Architect

@ University of Texas at Austin | Austin, TX

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

AI Engineering Manager

@ M47 Labs | Barcelona, Catalunya [Cataluña], Spain