all AI news
Fine-tune Llama 3 with ORPO
April 19, 2024, 2:40 p.m. | Maxime Labonne
Towards Data Science - Medium towardsdatascience.com
A cheaper and faster unified fine-tuning technique
Image generated with DALL-E 3 by authorORPO is a new exciting fine-tuning technique that combines the traditional supervised fine-tuning and preference alignment stages into a single process. This reduces the computational resources and time required for training. Moreover, empirical results demonstrate that ORPO outperforms other alignment methods on various model sizes and benchmarks.
In this article, we will fine-tune the new Llama 3 8B model using ORPO with the TRL library. The …
artificial intelligence editors pick hands-on-tutorials large language models machine learning
More from towardsdatascience.com / Towards Data Science - Medium
Jobs in AI, ML, Big Data
Software Engineer for AI Training Data (School Specific)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Python)
@ G2i Inc | Remote
Software Engineer for AI Training Data (Tier 2)
@ G2i Inc | Remote
Data Engineer
@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania
Artificial Intelligence – Bioinformatic Expert
@ University of Texas Medical Branch | Galveston, TX
Lead Developer (AI)
@ Cere Network | San Francisco, US