The Real E2E RAG Stack // Sam Bean // MLOps Podcast #217 | allainews.com

March 8, 2024, 6:19 p.m. | MLOps.community

MLOps.community www.youtube.com

MLOps podcast #217 with Sam Bean, Software Engineer (Applied AI) at Rewind.ai, The Real E2E RAG Stack.

// Abstract
What does a fully operational LLM + Search stack look like when you're running your own retrieval and inference infrastructure? What does the flywheel really mean for RAG applications? How do you maintain the quality of your responses? How do you prune/dedupe documents to maintain your document quality?

// Bio
Sam has been training, evaluating, and deploying production-grade inference solutions for …

abstract applications applied ai e2e engineer flywheel inference infrastructure llm look mean mlops mlops podcast podcast rag retrieval running sam search software software engineer stack

More from www.youtube.com / MLOps.community

What is AI Quality? // Mohamed Elgendy // MLOps Podcast #229 1 day, 10 hours ago | www.youtube.com

abstract ceo co-founder concept +11

AI's Struggle with Abstraction in Analogies // Shane Morris // MLOps podcast #223 clip 2 days, 11 hours ago | www.youtube.com

abstract automation autonomous autonomous systems +19

The Mind Behind the AI Coding Assistant // Peter Guagenti // MLOps podcast #222 clip 3 days, 11 hours ago | www.youtube.com

ai coding ai coding assistant assistant business +20

Streamlining Model Deployment // Daniel Lenton // AI in Production Talk 3 days, 15 hours ago | www.youtube.com

abstract aiaas ai companies ai infrastructure +21

LLMOps and GenAI at Enterprise Scale - Challenges and Opportunities // Andy McMahon // AI … 3 days, 15 hours ago | www.youtube.com

abstract andy challenges development +17

Data Labeling Best Practices // Charles Brecque // AI in Production Conference Lightning Talk 3 days, 15 hours ago | www.youtube.com

abstract best practices bio conference +17

Explaining ChatGPT to Anyone in 10 Minutes // Cameron Wolfe // AI in Production Conference 3 days, 15 hours ago | www.youtube.com

abstract become chatgpt conference +13

Handling Multi-Terabyte LLM Checkpoints // Simon Karasik // MLOps Podcast #228 4 days, 10 hours ago | www.youtube.com

abstract big cloud cloud storage +15

Leading Enterprise Data Teams // Sol Rashidi // MLOps Podcast #227 1 week, 1 day ago | www.youtube.com

abstract building cases ceo +20

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net

AI Engineer Intern, Agents

@ Occam AI | US

View on ai-jobs.net

AI Research Scientist

@ Vara | Berlin, Germany and Remote

View on ai-jobs.net

Data Architect

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Data ETL Engineer

@ University of Texas at Austin | Austin, TX

View on ai-jobs.net

Lead GNSS Data Scientist

@ Lurra Systems | Melbourne

View on ai-jobs.net