Building RAG-based LLM Applications for Production // Philipp Moritz & Yifei Feng // LLMs III Talk | allainews.com

Nov. 6, 2023, 12:31 a.m. | MLOps.community

MLOps.community www.youtube.com

// Abstract
In this talk, we will cover how to develop and deploy RAG-based LLM applications for production. We will cover how the major workloads (data loading and preprocessing, embedding, serving) can be scaled on a cluster, how different configurations can be evaluated and how the application can be deployed. We will also give an introduction to Anyscale Endpoints which offers a cost-effective solution for serving popular open-source models.

// Bio
Philipp Moritz
Philipp Moritz is one of the creators …

abstract applications building cluster data data loading deploy embedding iii llm llm applications llms loading major production rag talk workloads

More from www.youtube.com / MLOps.community

Handling Multi-Terabyte LLM Checkpoints // Simon Karasik // MLOps Podcast #228 19 hours ago | www.youtube.com

abstract big cloud cloud storage +15

Leading Enterprise Data Teams // Sol Rashidi // MLOps Podcast #227 4 days, 16 hours ago | www.youtube.com

abstract building cases ceo +20

The Changing Face of AI Engineering // Amritha Arun Babu & Abhik Choudhury // Podcast … 5 days, 20 hours ago | www.youtube.com

ai engineer analytics build cases +18

Building Conversational AI Agents with Voice // Michelle Chan // AI in Production Conference 6 days ago | www.youtube.com

abstract agents ai agents baseten +21

Reliable Hallucination Detection in Large Language Models // Jiaxin Zhang // AI in Production Talk 6 days ago | www.youtube.com

abstract detection hallucination hallucinations +11

Fostering Connections and Careers with MLOps Community // Demetrios Brinkmann // Podcast #220 clip 6 days, 16 hours ago | www.youtube.com

building community discuss founder +9

Shipping LLMs: Buckle Up & Enjoy the Ride // Rex Harris // AI in Production … 1 week ago | www.youtube.com

abstract adventure buckle up challenges +18

Accelerate ML Production with Agents // Salma Mayorquin // AI in Production Conference 1 week ago | www.youtube.com

abstract abstraction agents challenges +16

DSPy Assertions: Computational Constraints for Self-Refining LM Pipelines // Arnav Singhvi // Talk 1 week ago | www.youtube.com

abstract challenge computational constraints +13

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net

Senior Machine Learning Engineer (MLOps)

@ Promaton | Remote, Europe

View on ai-jobs.net

MLOps Engineer - Hybrid Intelligence

@ Capgemini | Madrid, M, ES

View on ai-jobs.net

Analista de Business Intelligence (Industry Insights)

@ NielsenIQ | Cotia, Brazil

View on ai-jobs.net