March 8, 2024, 6:19 p.m. | MLOps.community

MLOps.community www.youtube.com

MLOps podcast #217 with Sam Bean, Software Engineer (Applied AI) at Rewind.ai, The Real E2E RAG Stack.

// Abstract
What does a fully operational LLM + Search stack look like when you're running your own retrieval and inference infrastructure? What does the flywheel really mean for RAG applications? How do you maintain the quality of your responses? How do you prune/dedupe documents to maintain your document quality?

// Bio
Sam has been training, evaluating, and deploying production-grade inference solutions for …

abstract applications applied ai e2e engineer flywheel inference infrastructure llm look mean mlops mlops podcast podcast rag retrieval running sam search software software engineer stack

More from www.youtube.com / MLOps.community

Software Engineer for AI Training Data (School Specific)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Python)

@ G2i Inc | Remote

Software Engineer for AI Training Data (Tier 2)

@ G2i Inc | Remote

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

Lead Developer (AI)

@ Cere Network | San Francisco, US