Preemption Chaos and Optimizing Server Startup // Bradley Heilbrun // LLMs in Prod Conference Part 2 | allainews.com

Aug. 17, 2023, 1:09 p.m. | MLOps.community

MLOps.community www.youtube.com

// Abstract
GPU-enabled hosts are a significant driver of cloud costs for teams serving LLMs in production. Preemptible instances can provide significant savings but generally aren’t fit for highly available services. This lightning talk tells the story of how Replit switched to preemptible GKE nodes, tamed the ensuing chaos, and saved buckets of cash while improving uptime.

// Bio
Replit engineer focused on reliable and scalable LLM infrastructure. Formerly, YouTube's first SRE, longtime Googler and early PayPal linux guy.

abstract chaos cloud conference costs driver gke gpu instances llms part prod production replit server services startup story talk

More from www.youtube.com / MLOps.community

From A Coding Startup to AI Development in the Enterprise // Ryan Carson // MLOps … 4 hours ago | www.youtube.com

abstract access ai development building +20

AI Quality in Mo's Eyes // Mohamed Elgendy // MLOps Podcast #229 clip 1 day, 5 hours ago | www.youtube.com

abstract ceo co-founder concept +11

AI Innovations: The Power of Feature Platforms // MLOps Mini Summit #6 2 days, 4 hours ago | www.youtube.com

abstract ai innovations build building +19

FEDML Nexus AI: Your Generative AI Platform at Scale // Salman Avestimehr // MLOps podcast … 3 days, 4 hours ago | www.youtube.com

abstract ai applications ai platform applications +15

What is AI Quality? // Mohamed Elgendy // MLOps Podcast #229 1 week ago | www.youtube.com

abstract ceo co-founder concept +11

AI's Struggle with Abstraction in Analogies // Shane Morris // MLOps podcast #223 clip 1 week, 1 day ago | www.youtube.com

abstract automation autonomous autonomous systems +19

The Mind Behind the AI Coding Assistant // Peter Guagenti // MLOps podcast #222 clip 1 week, 2 days ago | www.youtube.com

ai coding ai coding assistant assistant business +20

Streamlining Model Deployment // Daniel Lenton // AI in Production Talk 1 week, 2 days ago | www.youtube.com

abstract aiaas ai companies ai infrastructure +21

LLMOps and GenAI at Enterprise Scale - Challenges and Opportunities // Andy McMahon // AI … 1 week, 2 days ago | www.youtube.com

abstract andy challenges development +17

Data Engineer

@ Lemon.io | Remote: Europe, LATAM, Canada, UK, Asia, Oceania

View on ai-jobs.net

Artificial Intelligence – Bioinformatic Expert

@ University of Texas Medical Branch | Galveston, TX

View on ai-jobs.net

Lead Developer (AI)

@ Cere Network | San Francisco, US

View on ai-jobs.net

Research Engineer

@ Allora Labs | Remote

View on ai-jobs.net

Ecosystem Manager

@ Allora Labs | Remote

View on ai-jobs.net

Founding AI Engineer, Agents

@ Occam AI | New York

View on ai-jobs.net