all AI news
The Real E2E RAG Stack // Sam Bean // MLOps Podcast #217
March 8, 2024, 6:19 p.m. | MLOps.community
MLOps.community www.youtube.com
// Abstract
What does a fully operational LLM + Search stack look like when you're running your own retrieval and inference infrastructure? What does the flywheel really mean for RAG applications? How do you maintain the quality of your responses? How do you prune/dedupe documents to maintain your document quality?
// Bio
Sam has been training, evaluating, and deploying production-grade inference solutions for …
abstract applications applied ai e2e engineer flywheel inference infrastructure llm look mean mlops mlops podcast podcast rag retrieval running sam search software software engineer stack
More from www.youtube.com / MLOps.community
Jobs in AI, ML, Big Data
Founding AI Engineer, Agents
@ Occam AI | New York
AI Engineer Intern, Agents
@ Occam AI | US
AI Research Scientist
@ Vara | Berlin, Germany and Remote
Data Architect
@ University of Texas at Austin | Austin, TX
Data ETL Engineer
@ University of Texas at Austin | Austin, TX
Lead GNSS Data Scientist
@ Lurra Systems | Melbourne