all AI news
Building RAG-based LLM Applications for Production // Philipp Moritz & Yifei Feng // LLMs III Talk
Nov. 6, 2023, 12:31 a.m. | MLOps.community
MLOps.community www.youtube.com
In this talk, we will cover how to develop and deploy RAG-based LLM applications for production. We will cover how the major workloads (data loading and preprocessing, embedding, serving) can be scaled on a cluster, how different configurations can be evaluated and how the application can be deployed. We will also give an introduction to Anyscale Endpoints which offers a cost-effective solution for serving popular open-source models.
// Bio
Philipp Moritz
Philipp Moritz is one of the creators …
abstract applications building cluster data data loading deploy embedding iii llm llm applications llms loading major production rag talk workloads
More from www.youtube.com / MLOps.community
Leading Enterprise Data Teams // Sol Rashidi // MLOps Podcast #227
4 days, 16 hours ago |
www.youtube.com
Jobs in AI, ML, Big Data
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne
Senior Machine Learning Engineer (MLOps)
@ Promaton | Remote, Europe
MLOps Engineer - Hybrid Intelligence
@ Capgemini | Madrid, M, ES
Analista de Business Intelligence (Industry Insights)
@ NielsenIQ | Cotia, Brazil